Safe Haskell | Safe |
---|---|
Language | Haskell98 |
Parse a string into our custom tag soup data structure.
The parser works only on proper Unicode texts.
That is, you must have decoded it before,
e.g. using decoding functions from hxt or encoding package.
findMetaEncoding
can assist you retrieving the character set encoding
from meta information of the document at hand.
- class C char => CharType char
- runSoup :: (C source, StringType sink, Attribute name, Tag name) => source -> [T name sink]
- runSoupWithPositions :: (C source, StringType sink, Attribute name, Tag name) => source -> [T name sink]
- runSoupWithPositionsName :: (C source, StringType sink, Attribute name, Tag name) => FilePath -> source -> [T name sink]
- runTag :: (C source, StringType sink, Show sink, Attribute name, Tag name, Show name) => source -> T name sink
- runInnerOfTag :: (StringType sink, Show sink, Attribute name, Tag name, Show name) => String -> T name sink
Documentation
runSoup :: (C source, StringType sink, Attribute name, Tag name) => source -> [T name sink] Source #
Like runSoupWithPositions
but hides source file positions.
runSoupWithPositions :: (C source, StringType sink, Attribute name, Tag name) => source -> [T name sink] Source #
Parse an HTML document to a list of T
.
Automatically expands out escape characters.
runSoupWithPositionsName :: (C source, StringType sink, Attribute name, Tag name) => FilePath -> source -> [T name sink] Source #