version 1.14, 1999/06/25 07:45:18
|
version 1.15, 1999/07/01 08:12:23
|
Line 31
|
Line 31
|
</H1> |
</H1> |
<P> |
<P> |
<A href="#Perl">Perl source</A> | <A HREF="#Recommended">Recommended Character |
<A href="#Perl">Perl source</A> | <A HREF="#Recommended">Recommended Character |
Data File</A> | <A HREF="#How">How to use</A> | Future Plans | Background |
Data File</A> | <A HREF="#How">How to use</A> | <A HREF="#Future">Future |
| Version History |
Plans</A> | Background | Version History |
<H3> |
<H3> |
<A NAME="Perl">Perl Source</A> and Installation |
<A NAME="Perl">Perl Source</A> and Installation |
</H3> |
</H3> |
Line 62 send any comments on <EM>the data file</
|
Line 62 send any comments on <EM>the data file</
|
<A NAME="How">How to use charlint</A> |
<A NAME="How">How to use charlint</A> |
</H3> |
</H3> |
<P> |
<P> |
Charlint |
Charlint is a perl script that works as a simple filter. It uses UTF-8 both |
<PRE>Available options: |
for input and for output. Behaviour can be fine-tuned with various options. |
|
A list of options as the one below can be optained by using <KBD>charlint |
(options prefixed by # are currently not available) |
-h</KBD>. |
|
<PRE>(options prefixed by # are currently not available) |
-b: Remove initial 'Byte Order Mark' |
-b: Remove initial 'Byte Order Mark' |
-B: Supress warning about initial 'Byte Order Mark' |
-B: Supress warning about initial 'Byte Order Mark' |
-d: Debug: Thoroughly check character data table input |
-d: Debug: Thoroughly check character data table input |
Line 85 Charlint
|
Line 86 Charlint
|
-u: # Fix UTF-8 (convert or remove) |
-u: # Fix UTF-8 (convert or remove) |
-U: Supress checking correctness of UTF-8 |
-U: Supress checking correctness of UTF-8 |
-v: Print version |
-v: Print version |
|
|
</PRE> |
</PRE> |
<H3> |
<H3> |
|
<A NAME="Future">Future Plans</A> |
</H3> |
</H3> |
<P> |
<P> |
|
We have just released the first version of charlint. There are many things |
|
we plan to add in the future: |
|
<UL> |
|
<LI> |
|
Hangul syllable normalization |
|
<LI> |
|
Removal of undefined codepoints and codepoints in the private zone |
|
<LI> |
|
Removal/fix of incorrect UTF-8 |
|
</UL> |
|
<P> |
<HR> |
<HR> |
<ADDRESS> |
<ADDRESS> |
<A HREF="mailto:duerst@w3.org">Martin Dürst</A> <BR> |
<A HREF="mailto:duerst@w3.org">Martin Dürst</A> <BR> |