beautypg.com

Generating, Pdf to html, xml, plain text, Html files from pdf – Adobe Acrobat 7 Professional User Manual

Page 178: Bookmarks, Images, Html, Plain text, Converting, Html and xml. (see, Conversion options for

background image

Conversion options for HTML, XML, or plain text format

By default, images are converted to JPEG format.

Encoding

Refers to the binary values, based on international standards, used to represent the text
characters. UTF-8 is a Unicode representation of characters using one or more 8-bit bytes
per character. UTF-16 is a Unicode representation of characters using one or more 16-bit
bytes per character. ISO-Latin-1 is an 8-bit representation of characters that is a superset
of ASCII. UCS-4 is a Universal Character Set coded in 4 octets. HTML/ASCII is a 7-bit
representation of characters developed by ANSI.

Use Mapping Table Default uses the default character encoding defined in mapping
tables, which appear in the Plug-ins/SaveAsXML/MappingTables folder. These mapping
tables specify many characteristics of how the data is output, including the default
character encoding. These defaults are:

Save as XML: UTF-8

Save as Text: Host encoding, which is defined by the operating system, based on its locale
setting

Save as HTML 3.0: HTML/ASCII

Save as HTML 4.0.1: UTF-8

Generate Bookmarks

Generates bookmark links to content for HTML or XML documents. Links are placed at
the beginning of the resulting HTML or XML document.

Generate Tags For Untagged Files

Generates tags for files that are not already tagged, such as PDF files created using
Acrobat 4.0 or earlier. If this option is not selected, untagged files are not converted.

Note: Tags are applied only as part of the conversion process and are discarded after the
conversion. This is not a method for creating tagged PDF files from legacy files.

Generate Images

Controls how images are converted. Converted image files are referenced from within
XML and HTML documents.

Use Sub-Folder

Specify the name of the folder in which to store generated images. The default is Images.

Use Prefix

You can specify a prefix to be added to the image file names in case you have several
versions of the same image file. File names assigned to images have the format
filename_img_#.

Output Format

The default is JPG.

Downsample To

If you do not select this option, image files have the same resolution as in the source file.
Image files are never upsampled.

This manual is related to the following products: