Markup, XML, TEI

Character, structure, guidelines

Content

Procedural Markup

  • Visual, typographic, representation;
  • WYSWYG: What You See is ALL you Get.

  • Descriptive Markup

  • Semantic, funcional, structured;
  • Meaning before format.
  • "Miguel de Cervantes wrote Don Quixote"

    Procedural

    Miguel de Cervantes wrote Don Quixote

    Descriptive

     MarkupMiguel de Cervantes wrote Don Quixote
     Visualizationname {font-weight: bold;} title {font-style: italic;} 
     Process<xsl:for-each select="//name">

    XML Golden Rules

     An element consists of a start-tag, optional content, and an end-tag. Miguel de Cervantes
     An element may have attributes, each consisting of a name and a value. Miguel de Cervantes
     An XML document may contain processing instructions or comments. <? xml-stylesheet type="text/css" href="visualize-tei-xml.css"?>
         
     Elements and attributes are case sensitive. <name> ≠ <Name>
     Empty elements can use a minimized tag syntax. <pb/> =   

    Well-formed (good syntax)

    • XML-Version, character set;
    • A single root element (first node) contains the XML document;
    • Each element is nested within the root element in a hierarchical structure: no overlapping;
    • Elements, attributes, etc., are case sensitive;
    • Start-tags and end-tags are mandatory;
    • Attribute values are quoted.

    Valid (against the schema)

    • Well-formed;
    • Follows a structural definition (schema);
    • Which elements and attributes are used;
    • Which elements and attributes can appear where;
    • Names, datatypes, and default values for all attributes.
    • Development of markup for texts;
    • International standard;
    • Interchange of data;
    • Platform independence;
    • Preservation;
    • Research community.
    • TEI P5: Version 3.5.0. Last updated on 29th January 2019

    Modular system

    Each module defines a group of elements and attributes

    TEI basic structure

    TEI basic structure

    <teiHeader>

    TEI basic structure

    <teiHeader>

    Content in the <teiHeader>

    <teiHeader>
      <fileDesc>
    	<titleStmt>
    		<title>EXERCISE 01 · Encodage de base</title>
    		<author>José Luis Losada</author>
    	 </titleStmt>
    	<publicationStmt>
    		<publisher></publisher>
    		<date when="2019-04-25">25/04/2019</date>
    	 </publicationStmt>
    	<sourceDesc>
    		<bibl><title>La perle de Séville</title></bibl>
    	 </sourceDesc>
      </fileDesc>
    </teiHeader>

    TEI basic structure

    <text>

    TEI basic structure

    <text>

    TEI basic structure

    <text>

    Content in the <text>

    
        <body>
          
    <head>AVERTISSEMENT</head>

    Je demande pardon aux lecteurs français d'avoir choisi, parmi le nombreuses pièces de Lope de Vega, celle-ci, où le précepte des trois unités semble avoir été plus scandaleusement violé

    </body>

    Classes of elements

  • grouped semantically: names, places, titles, ...
  • grouped structurally: divisions, paragraphs, verses,...
  • Verse and name annotation

    <body>
    
      <persName>Doris</persName>, qui sait qu'aux vers quelquefois je me plais,
      Me demande un sonnet, et je m'en désespère:
      Quatorze vers, grand Dieu! Le moyen de les faire?
      En voilà cependant déjà quatre de faits.
    
    </body>
    

    Atributtes inside of elements

     Provides a unique identifier for any element:  @xml:id 
     Provides a name or number for any element:  @n 
     Specifies the language of any element:  @xml:lang 
     Characterizes the element in some sense:  @type 
     Specifies the visual appearance (rendition):  @rend, @style, @rendition 

    Atributtes and elements

    <body>
       
          Doris, qui ſait qu'aux vers quelquefois je me plais,
          Me demande un Sonnet; & je m'en deſeſpere:
          Quatorze vers, grand Dieu! le moyen de les faire!
          En voilà cependant desja quatre de faits.
      
    </body>
    

    Metatextual markup for editions

    Enriched with corrections, regularizations, expansions
     Groups alternative encodings:    
     Apparent error:  
     Corrected error:   
     Original form:   
     Regularized form:   

    Markup for corrections

    <body>
    
          Doris, qui ſait qu'aux vers quelquefois je me plais,
          Me demande un Sonnet; & je m'en deſeſpere:
          Quatorze vers, grand Dieu! le moyen de les faire!
          En voilà cependant 
    		
    			desja
    			déjà
    		 
                quatre de faits.
    
    </body>

    Critical Apparatus, Dictionaries, Manuscript Description, Bibliographies, ...

    nearly 500 elements

    Guidelines

    Information, eg., on an element

    Quick look up in the TEI Guidelines.

    TEI-Look up

    TEI Roma (tool)

    Generating customizations for the TEI
    • possibility to choose your modules;
    • possibility to create new elements;
    • possibility to generate your schema;
    • possibility to generate your documentation.

    References

    José Luis Losada Palenzuela, Markup, XML, TEI. Character, structure, guidelines, <http://editio.github.io/presentations/xml-tei>, 2019.

    License Creative Commons