Annotation of XML/TODO, revision 1.14
1.1 daniel 1:
2: TODO for the XML parser:
3:
1.14 ! daniel 4: - API to search the lang of an attribute
! 5:
1.13 daniel 6: - Optimization of tag strings allocation.
7:
1.12 daniel 8: - Language identification code, productions [33] to [38]
9:
10: - Conditional sections in DTDs [61] to [65]
11:
12: - External Parsed entities, either XML or external Subset [78] and [79]
1.14 ! daniel 13: (on the workbench) parsing the xmllang DtD seems to end up in an
! 14: infinite loop though...
1.12 daniel 15:
1.11 daniel 16: - Collect IDs at parsing and maintain a table. PBM: maintain the table coherency
17:
1.14 ! daniel 18: - Use it for XPath ID support
1.11 daniel 19:
1.10 daniel 20: - Start adding validation, now that the DTD are fully parsed.
21:
1.7 daniel 22: - Support for UTF-8 and UTF-16 encoding (Urgent !!!).
1.8 daniel 23: => added some convertion routines provided by Martin Durst but I didn't
24: try to glue them in. I plan to keep everything internally as UTF-8
25: this is slightly more costly but more compact, and recent processors
26: efficiency is cache related. The key for good performances is keeping
27: the data set small, so will I.
28:
1.7 daniel 29: - progressive parsing. The entity support is a first step toward
30: asbtraction of an input stream. A large part of the context is still
31: located on the stack, moving to a state machine and putting everyting
32: in the parsing context should provide an adequate solution.
1.8 daniel 33: => Rather than progressive parsing, give more power to the SAX-like
34: interface. Currently the DOM-like representation is built but
35: it should be possible to define that only as a set of SAX callbacks
36: and remove the tree creation from the parser code.
37:
1.14 ! daniel 38:
! 39: Done:
1.1 daniel 40: - DOM support, instead of using a proprietary in memory
41: format for the document representation, the parser should
42: call a DOM API to actually build the resulting document.
43: Then the parser becomes independent of the in-memory
44: representation of the document. Even better using RPC's
45: the parser can actually build the document in another
46: program.
1.8 daniel 47: => Work started, now the internal representation is by default
48: very near a direct DOM implementation. The DOM glue is implemented
1.14 ! daniel 49: as a separate module. See the gdome module.
1.2 daniel 50: - C++ support : John Ehresman <jehresma@dsg.harvard.edu>
1.6 daniel 51: - Updated code to follow more recent specs, added compatibility flag
1.7 daniel 52: - Better error handling, use a dedicated, overridable error
53: handling function.
54: - Support for CDATA.
55: - Keep track of line numbers for better error reporting.
56: - Support for PI (SAX one).
1.8 daniel 57: - Support for Comments (bad, should be in ASAP, they are parsed
58: but not stored), should be configurable.
59: - Improve the support of entities on save (+SAX).
1.2 daniel 60:
1.14 ! daniel 61: $Id: TODO,v 1.13 1999/07/08 18:33:41 daniel Exp $
Webmaster