HTML Parser settings

The HTML parser rules tell WorldServer how to process the elements found in HTML documents and what content to extract for translation.

The HTML Parser uses a list of predefined rules for the most common HTML elements. These default rules specify what content in the file is inline information and should be translated, and what content is structural information and should not be translated in the Editor. On the Parser page you can customize the existing parser rules or add new ones in order to extract more or less content from the HTML document.
The Parser Rules page
This page contains the list of existing parser rules, and has buttons for adding, deleting and modifying the rules. You cannot edit the rules directly on the Parser page. Click Add... , Edit... or Copy to display the dialog box that allows you change the parser rules and their settings. For mor information, see the Add/Edit/Copy Rule Page.
Option Description
Search For To search for an existing HTML parser rule, type the rule name in the Search for text box. WorldServer automatically jumps to the first rule corresponding to the search criteria.
Use the Previous and the Next arrows to navigate through the rules containing the search keyword.
If you use the same name (but specify different settings) for multiple rules, WorldServer prioritizes the most general one, with the least conditions assigned. You can use the Prioritize and the De-prioritize arrows to change the order that WorldServer considers when applying parse rules with the same name.
Import Click Import... to import parser rules settings that were created for different files. You can also import rules when creating a new HTML file type in the Create File Type wizard.
Note: This will not import any other settings available in *.ini and .*sdlftsettings files.
Remove Select a parser rule and click this to remove the rule from the list. WorldServer will no longer consider it when extracting HTML content.
Don't inherit from parent Enable this option if you want to display the context information as a root element in the Document Structure tree.