The latest version of fast-tagsoup-utf8-only is 1.0.5-0.

fast-tagsoup-utf8-only

Version 1.0.4 revision 0 uploaded by MikhailKuddah.

Package meta

Synopsis
Fast parser for tagsoup package
Description

Fast TagSoup parser. Speeds of 20-200MB/sec were observed.

Works only with strict bytestrings.

This library is intended to be used in conjunction with the original tagsoup package:

import Text.HTML.TagSoup hiding (parseTags, renderTags)
import Text.HTML.TagSoup.Fast.Utf8Only

Besides speed fast-tagsoup correctly handles HTML <script> and <style> tags and converts tags to lower case. This fork purposefully removes support for parsing non-utf8 documents, to avoid dependency on text-icu. If you need to handle other encodings, refer to the original http://hackage.haskell.org/package/fast-tagsoup

This parser is used in production in BazQux Reader feeds and comments crawler.

Author
Vladimir Shabanov <vshabanoff@gmail.com>
Bug reports
n/a
Category
XML
Copyright
Vladimir Shabanov 2011-2012
Homepage
https://github.com/vshabanov/fast-tagsoup
Maintainer
Vladimir Shabanov <vshabanoff@gmail.com>
Package URL
n/a
Stability
n/a

Components