summaryrefslogtreecommitdiff
path: root/Libraries/LibLine
diff options
context:
space:
mode:
authorAndreas Kling <kling@serenityos.org>2020-05-22 21:46:13 +0200
committerAndreas Kling <kling@serenityos.org>2020-05-22 21:46:13 +0200
commit272b35d2e109b5ad17af2e3b62e693da4bbc8440 (patch)
tree602b42d4086cb73a42e12a88eab9503bd80b0d60 /Libraries/LibLine
parent120cd44011f15bb101a94d181822b1de8eb82d73 (diff)
downloadserenity-272b35d2e109b5ad17af2e3b62e693da4bbc8440.zip
LibWeb: Begin work on a spec-compliant HTML parser
In order to actually view the web as it is, we're gonna need a proper HTML parser. So let's build one! This patch introduces the Web::HTMLTokenizer class, which currently operates on a StringView input stream where it fetches (ASCII only atm) codepoints and tokenizes acccording to the HTML spec tokenization algo. The tokenizer state machine looks a bit weird but is written in a way that tries to mimic the spec as closely as possible, in order to make development easier and bugs less likely. This initial version is far from finished, but it can parse a trivial document with a DOCTYPE and open/close tags. :^)
Diffstat (limited to 'Libraries/LibLine')
0 files changed, 0 insertions, 0 deletions