Build the future of communications.
Start building for free
  • By Samuel Huang
    Finding and Fixing Website Link Rot with Python, BeautifulSoup and Requests qOcPjaielXyexw-Jc56WJNGJ3_L2ADuLgzn8ILMeYFJin_FFb3fDlyeQD5DWvF2Cznez3TCVuNj8udFQCbR7XOAR_xoKVsSOjG3_GWlTE4DzYjKo-LwfSR7hFTW5Gaq5LdAwAduu

    When hyperlinks go dead by returning 404 or 500 HTTP status codes or redirect to spam websites, that is the awful phenomenon know as “link rot”. Link rot is a widespread problem; in fact, research shows that an average link lasts four years.

    In this blog post, we will look at how link rot affects user experience, using Full Stack Python as our example. We’ll build a Python script that detects link rot in Markdown and HTML files so we can quickly find and fix our issues.

    A Link Rot Example

    fullstackpython.com is a website created by Twilio employee Matt Makai in 2012. The site has helped many folks, including me, learn how to best use Python and the tools within its ecosystem.

    The site now has over 145,000 words and 150 pages, including:

    • 2400+ links in the repository
    • 300+ HTML files
    • 150+ Markdown files

    And there ...

    Read More
  • Newer
    Older
    Sign up and start building
    Not ready yet? Talk to an expert.