fast-tagsoup-utf8-only
Version 1.0.5 revision 0 uploaded by MikhailKuddah.
Package meta
- Synopsis
- Fast parser for tagsoup package
- Description
Fast TagSoup parser. Speeds of 20-200MB/sec were observed.
Works only with strict bytestrings.
This library is intended to be used in conjunction with the original
tagsoup
package:import Text.HTML.TagSoup hiding (parseTags, renderTags) import Text.HTML.TagSoup.Fast.Utf8Only
Besides speed
fast-tagsoup
correctly handles HTML<script>
and<style>
tags and converts tags to lower case. This fork purposefully removes support for parsing non-utf8 documents, to avoid dependency on text-icu. If you need to handle other encodings, refer to the original http://hackage.haskell.org/package/fast-tagsoupThis parser is used in production in BazQux Reader feeds and comments crawler.
- Author
- Vladimir Shabanov <vshabanoff@gmail.com>
- Bug reports
- n/a
- Category
- XML
- Copyright
- Vladimir Shabanov 2011-2012
- Homepage
- https://github.com/exbb2/fast-tagsoup
- Maintainer
- Vladimir Shabanov <vshabanoff@gmail.com>
- Package URL
- n/a
- Stability
- n/a