tagsoup

Version 0.14.8 revision 0 uploaded by NeilMitchell.

Package meta

Synopsis
Parsing and extracting information from (possibly malformed) HTML/XML documents
Description

TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

Author
Neil Mitchell <ndmitchell@gmail.com>
Bug reports
https://github.com/ndmitchell/tagsoup/issues
Category
XML
Copyright
Neil Mitchell 2006-2019
Homepage
https://github.com/ndmitchell/tagsoup#readme
Maintainer
Neil Mitchell <ndmitchell@gmail.com>
Package URL
n/a
Stability
n/a

Components