Annotation of XML/TODO, revision 1.12

1.1       daniel      1: 
                      2:            TODO for the XML parser:
                      3: 
1.12    ! daniel      4: - Language identification code, productions [33] to [38]
        !             5: 
        !             6: - Conditional sections in DTDs [61] to [65]
        !             7: 
        !             8: - External Parsed entities, either XML or external Subset [78] and [79]
        !             9: 
1.11      daniel     10: - Collect IDs at parsing and maintain a table. PBM: maintain the table coherency
                     11: 
                     12: - Use it for XPointer id()
                     13: 
1.10      daniel     14: - Start adding validation, now that the DTD are fully parsed.
                     15: 
1.7       daniel     16: - Support for UTF-8 and UTF-16 encoding (Urgent !!!).
1.8       daniel     17:   => added some convertion routines provided by Martin Durst but I didn't
                     18:      try to glue them in. I plan to keep everything internally as UTF-8
                     19:      this is slightly more costly but more compact, and recent processors
                     20:      efficiency is cache related. The key for good performances is keeping
                     21:      the data set small, so will I.
                     22: 
1.7       daniel     23: - progressive parsing. The entity support is a first step toward
                     24:   asbtraction of an input stream. A large part of the context is still
                     25:   located on the stack, moving to a state machine and putting everyting
                     26:   in the parsing context should provide an adequate solution.
1.8       daniel     27:   => Rather than progressive parsing, give more power to the SAX-like
                     28:      interface. Currently the DOM-like representation is built but
                     29:      it should be possible to define that only as a set of SAX callbacks
                     30:      and remove the tree creation from the parser code.
                     31: 
1.1       daniel     32: - DOM support, instead of using a proprietary in memory
                     33:   format for the document representation, the parser should
                     34:   call a DOM API to actually build the resulting document.
                     35:   Then the parser becomes independent of the in-memory
                     36:   representation of the document. Even better using RPC's
                     37:   the parser can actually build the document in another
                     38:   program.
1.8       daniel     39:   => Work started, now the internal representation is by default
                     40:      very near a direct DOM implementation. The DOM glue is implemented
                     41:      as a separate module. See gnome-dom !
1.1       daniel     42: 
1.2       daniel     43: Done:
                     44: - C++ support : John Ehresman <jehresma@dsg.harvard.edu>
1.6       daniel     45: - Updated code to follow more recent specs, added compatibility flag
1.7       daniel     46: - Better error handling, use a dedicated, overridable error
                     47:   handling function.
                     48: - Support for CDATA.
                     49: - Keep track of line numbers for better error reporting.
                     50: - Support for PI (SAX one).
1.8       daniel     51: - Support for Comments (bad, should be in ASAP, they are parsed
                     52:   but not stored), should be configurable.
                     53: - Improve the support of entities on save (+SAX).
1.2       daniel     54: 
1.12    ! daniel     55: $Id: TODO,v 1.11 1999/03/07 10:05:54 daniel Exp $

Webmaster