Plain text, Character sets – Sony PEG-NZ90 User Manual
Page 4

Picsel Text File Format Support
Page
4
ePAGE supports the most popular global file formats. The formats
interpreted by ePAGE are richly expressive, containing not just text but
sophisticated layout and rendering features, rich fonts, colour, images,
tables, graphics and many other document features. Picsel is
continuously developing its file format support to eventually cover every
feature of the native file. With such a wealth of features across many
document types, this is inevitably an ongoing process, with milestone
releases of new functionality planned at periodic intervals. The
approach involves researching the feature set most commonly found in
real documents, building support early for the most frequently used
elements, and ensuring these features are reproduced with total
faithfulness to the original. The emphasis of ePAGE is on displaying rich
content rather than on reproducing the document creation facilities of
the original application.
This document describes the features supported in ePAGE. This level of
support already covers the vast majority of characteristics that occur in
day to day documents of this type, and the specific features are
described with notes where appropriate. Those features planned for
future implementation are also described, for completeness.
Plain Text
While there are many document formats which encode extra
information such as page size and pictures into a single file, the most
basic commonly used file format for text is called a plain text file. This
has no special file structure; from the first byte to the last is a seqeuence
of characters. Some of these characters do not appear visually, but
instead take new lines or indent to a tab stop, as defined in the ASCII
standard (ANSI X3.4, initially approved in 1969).
Feature Support
Notes
Plain 7-bit ASCII text
Yes
Latin1 text
Yes
Newlines Yes
Tabs
Yes
Converted to sequences of spaces
Unlimited page size
Yes
Character Sets
ePAGE can work with all characters in the Unicode specification. Text
documents are encoded in other encodings, and ePAGE handles these
by converting Latin1 to UTF-16 Unicode. (In addition to the encoding