3 - limit the length of markup elements that never end. Perhaps by
4 configurable limits on the length that markup can have and still
5 be recongnized. Report stuff as 'text' when this happens?
6 - remove 255 char limit on literal argspec strings
7 - implement backslash escapes in literal argspec string
8 - <![%app1;[...]]> (parameter entities)
9 - make literal tags configurable. The current list is hardcoded
10 to be ""script", "style", "title", "textarea", "xmp", and "plaintext".
12 SGML FEATURES WE WILL PROBABLY IGNORE FOREVER
13 - Empty tags: <> </> (repeat previous start tag)
14 - <foo<bar> (same as <foo><bar>)
18 POSSIBLE OPTIMIZATIONS
19 - none that I can think of right now
23 MINOR "BUGS" (alias FEATURES)
24 - no way to clear "boolean_attribute_value".
25 - <style> and <script> do not end with the first "</".
28 MSIE bug compatibility
29 - recognize server side includes as comments; <% ... %>
30 if no matching %> found tread "<% ..." as text
31 - skip quoted strings when looking for PIC