[Haskell-cafe] HTML library with DOM?

Gregory Collins greg at gregorycollins.net
Thu Oct 7 08:41:19 EDT 2010


Michael Snoyman <michael at snoyman.com> writes:

> As far as I know, Neil Mitchel's tagsoup[1] parses according to the
> HTML 5 parsing rules, but it just generates a list of Tags[2], so
> you'd have to build the DOM tree up from there. I personally have had
> great experience with tagsoup. It's even the core of HTML-scraping
> technology powering searchonce[3].

Yep, someone else wrote me privately to say this (that tagsoup respects
the html5 lexing rules). So I'll be using this as the basis of an html5
DOM parser. Stay tuned!

G
-- 
Gregory Collins <greg at gregorycollins.net>


More information about the Haskell-Cafe mailing list