diff options
Diffstat (limited to 'libs/bs4/TODO.txt')
-rw-r--r-- | libs/bs4/TODO.txt | 31 |
1 files changed, 0 insertions, 31 deletions
diff --git a/libs/bs4/TODO.txt b/libs/bs4/TODO.txt deleted file mode 100644 index e26d6264d..000000000 --- a/libs/bs4/TODO.txt +++ /dev/null @@ -1,31 +0,0 @@ -Additions ---------- - -More of the jQuery API: nextUntil? - -Optimizations -------------- - -The html5lib tree builder doesn't use the standard tree-building API, -which worries me and has resulted in a number of bugs. - -markup_attr_map can be optimized since it's always a map now. - -Upon encountering UTF-16LE data or some other uncommon serialization -of Unicode, UnicodeDammit will convert the data to Unicode, then -encode it at UTF-8. This is wasteful because it will just get decoded -back to Unicode. - -CDATA ------ - -The elementtree XMLParser has a strip_cdata argument that, when set to -False, should allow Beautiful Soup to preserve CDATA sections instead -of treating them as text. Except it doesn't. (This argument is also -present for HTMLParser, and also does nothing there.) - -Currently, htm5lib converts CDATA sections into comments. An -as-yet-unreleased version of html5lib changes the parser's handling of -CDATA sections to allow CDATA sections in tags like <svg> and -<math>. The HTML5TreeBuilder will need to be updated to create CData -objects instead of Comment objects in this situation. |