Wow the state-of-the-art for parsing HTML with the Python standard library is... pretty far back, huh

@xor I'd just use BeautifulSoup4 with the lxml backend


@divergentdave as a challenge I'm trying to stick to the standard library and hoo boy it is way less good

(I got it to work with regex but like...)

@xor (extremely xkcd voice) the HTML5 spec, with its standardization of parsing, postdates Python 3 by six years!

Sign in to participate in the conversation

The social network of the future: No ads, no corporate surveillance, ethical design, and decentralization! Own your data with Mastodon!