FileSize | 14984 |
MD5 | A0906E611021E54CA40B29A60B422C31 |
PackageDescription | simple CSS1 parser library for C
htmlcxx is a simple non-validating CSS1 and HTML parser for C++. Although
there are several other html parsers available, htmlcxx has some
characteristics that make it unique:
.
* STL like navigation of DOM tree, using excellent tree.hh library from
Kasper Peeters
* It is possible to reproduce exactly, character by character, the original
document from the parse tree
* Bundled CSS parser
* Optional parsing of attributes
* C++ code that looks like C++ (not so true anymore)
* Offsets of tags/elements in the original document are stored in the nodes
of the DOM tree
.
The parsing politics of htmlcxx were created trying to mimic Mozilla Firefox
(http://www.mozilla.org) behavior. So you should expect parse trees similar to
those create by Firefox. However, differently from Firefox, htmlcxx does not
insert non-existent stuff in your html. Therefore, serializing the DOM tree
gives exactly the same bytes contained in the original HTML document.
.
This package contains the C runtime library for CSS parsing. |
PackageMaintainer | Ubuntu Developers <ubuntu-devel-discuss@lists.ubuntu.com> |
PackageName | libcss-parser0 |
PackageSection | libs |
PackageVersion | 0.85-3 |
SHA-1 | 69F91F7BD69D07B939DF037FA53485D86A384F4C |
SHA-256 | 5B2B998049CD5ED0D6756D8FF53B0BB76BAA2F8AF34FEE4D61AE1334BBA77688 |
hashlookup:children-total | 3 |
hashlookup:trust | 50 |