TetraSix

Majix Light 1.1

MaJix

  • What's new ?
  • TetraSix Software License Agreement for Majix
  • Downloading Majix
  • Obtaining a Java VM
  • Running Majix
  • Majix process
  • Majix user interface
  • Majix input format
  • Majix intermediate format
  • Majix output format
  • Modify input format
  • Modify XML tag names
  • Browsing XML documents
  • The configuration file
  • The tools
  • The sample
  • Comments
  • Majix output format

    Majix convert RTF files into XML files.

    The prologue of the generated XML document is:

    <?xml version="1.0"?>

    <!DOCTYPE mydoc PUBLIC "-//TetraSix//DTD mydoc v1.1//EN" "mydoc.dtd" [

    ]>

    If the document contains pictures, the converter generates an entity set declaration with a notation and one entity declaration for each picture, like this (assuming there are three pictures):

    <!NOTATION WMF PUBLIC "-/TetraSix/NOTATION Windows Metafile/EN">

    <!ENTITY pict001 SYSTEM "pict001.wmf" NDATA WMF>

    <!ENTITY pict002 SYSTEM "pict002.wmf" NDATA WMF>

    <!ENTITY pict003 SYSTEM "pict003.wmf" NDATA WMF>

    Remark: this version of Majix does not extract the picture data from the RTF file.

    Majix produces by default XML documents conforming to the following DTD element set:

    <!ENTITY % section-content "p|list|table|ex|dl|note|attention">

    <!ENTITY % inline-text "b|i|u|c|v|emph|strong|prodname|jargon|tm|exemple|kw">

    <!ENTITY % inline-element "%inline-text;|graphic|link">

    <!ENTITY % id "id ID #IMPLIED">

    <!ELEMENT mydoc (info?, title?, (p|list|table|ex|dl|toc)*, (h1|h2)*)>

    <!ELEMENT info (title?, subject?, author?, operator?, manager?, company?)>

    <!ELEMENT subject (#PCDATA)>

    <!ELEMENT author (#PCDATA)>

    <!ELEMENT operator (#PCDATA)>

    <!ELEMENT manager (#PCDATA)>

    <!ELEMENT company (#PCDATA)>

    <!ELEMENT title (#PCDATA|i|u|c)*>

    <!ATTLIST title %id;>

    <!ELEMENT h1 (ht, (%section-content;)*, h2*)>

    <!ATTLIST h1 %id;>

    <!ELEMENT h2 (ht, (%section-content;)*, h3*)>

    <!ATTLIST h2 %id;>

    <!ELEMENT h3 (ht, (%section-content;)*, h4*)>

    <!ATTLIST h3 %id;>

    <!ELEMENT h4 (ht, (%section-content;)*, h5*)>

    <!ATTLIST h4 %id;>

    <!ELEMENT h5 (ht, (%section-content;)*, h6*)>

    <!ATTLIST h5 %id;>

    <!ELEMENT h6 (ht, (%section-content;)*)>

    <!ATTLIST h6 %id;>

    <!ELEMENT ht (#PCDATA|%inline-text;)*>

    <!ATTLIST ht %id;>

    <!ELEMENT p (#PCDATA|%inline-element;)*>

    <!ATTLIST p %id;>

    <!ELEMENT note (#PCDATA|%inline-element;)*>

    <!ATTLIST note %id;>

    <!ELEMENT attention (#PCDATA|%inline-element;)*>

    <!ATTLIST attention %id;>

    <!ELEMENT b (#PCDATA|%inline-element;)*>

    <!ELEMENT i (#PCDATA|%inline-element;)*>

    <!ELEMENT u (#PCDATA|%inline-element;)*>

    <!ELEMENT v (#PCDATA|%inline-element;)*>

    <!ELEMENT c (#PCDATA|%inline-element;)*>

    <!ATTLIST c color

    CDATA #IMPLIED>

    <!ELEMENT emph (#PCDATA|%inline-element;)*>

    <!ELEMENT strong (#PCDATA|%inline-element;)*>

    <!ELEMENT prodname (#PCDATA|%inline-element;)*>

    <!ELEMENT jargon (#PCDATA|%inline-element;)*>

    <!ELEMENT tm (#PCDATA|%inline-element;)*>

    <!ELEMENT exemple (#PCDATA|%inline-element;)*>

    <!ELEMENT kw (#PCDATA|%inline-element;)*>

    <!ELEMENT list (item|list)+>

    <!ATTLIST list style (bullet|numeric) #IMPLIED

    %id;>

    <!ELEMENT item (p*)>

    <!ATTLIST item %id;>

    <!ELEMENT graphic EMPTY>

    <!ATTLIST graphic url CDATA #IMPLIED

    %id;>

    <!ELEMENT table (row+)>

    <!ATTLIST table %id;>

    <!ELEMENT row (cell+)>

    <!ATTLIST row %id;>

    <!ELEMENT cell (p|list|ex)*>

    <!ATTLIST cell %id;

    width CDATA #IMPLIED>

    <!ELEMENT ex (xl*)>

    <!ATTLIST ex %id;>

    <!ELEMENT xl (#PCDATA|b|i|u|c|graphic|link)*>

    <!ELEMENT toc EMPTY>

    <!ELEMENT link (#PCDATA|b|i|u|c|graphic)*>

    <!ATTLIST link

    url CDATA #IMPLIED

    refid CDATA #IMPLIED>

    <!ELEMENT dl ((dt,dd)*)>

    <!ATTLIST dl %id;>

    <!ELEMENT dt (#PCDATA|%inline-text;)*>

    <!ATTLIST dt %id;>

    <!ELEMENT dd (#PCDATA|%inline-text;)*>

    <!ATTLIST dd %id;>

    <!ELEMENT toc EMPTY>

    <!ENTITY undef "">

    where the use of the various elements is the following:

    attention
    a warning
    author
    author of the document
    b
    text in bold (occurs in paragraphs)
    c
    text in color (occurs in paragraphs)
    cell
    a table cell
    company
    the company's name
    emph
    emphasis
    ex
    code exemple
    exemple
    code exemple (in-line)
    graphic
    an embedded picture
    h1
    a section of level 1
    h2
    a section of level 2
    h3
    a section of level 3
    h4
    a section of level 4
    h5
    a section of level 5
    h6
    a section of level 6
    ht
    the title of a section
    i
    text in italic (occurs in paragraphs)
    info
    a bloc of information concerning the document
    item
    item of a list
    jargon
    technical term
    list
    list of items
    kw
    keyword
    manager
    the manager's name
    mydoc
    the top-level element
    note
    an inline note
    operator
    the typist's name
    p
    a text paragraph
    prodname
    name of a product
    row
    a table row
    strong
    another kind of emphasis
    subject
    the subject of the document
    table
    a table
    title
    the title of the document
    tm
    trade mark
    u
    text in underline (occurs in paragraphs)
    v
    hidden text (occurs in paragraphs)
    xl
    code exemple line

    It is however possible to change the names of the tags. Of course, if you do so you will also need to change the DTD.


    Copyright TetraSix, 1999 - info@tetrasix.com