This document describes a draft encoding model for Genetic Editions and Genetic Editing. The document is the product of a Workgroup on Genetic Editions (chair: Fotis Jannidis), which is part of the TEI MS SIG (chairs: Elena Pierazzo, Malte Rehbein, Amanda Galley).
The workgroup's goal was to develop an Application Profile for the encoding of genetic editions and, in general, genetic phenomena. It is expressed as a TEI P5 conformant customization, integrating material from the existing TEI Guidelines, chiefly Chapter 11. Representation of Primary Sources and Chapter 12. Critical Apparatus, together with additional new material. It may eventually, at the end of the process described in the following section, constitute a self standing new Guidelines chapter, or remain a set of recommendations for how to customize the Guidelines, but that is a decision for the TEI Council.
The document reflects discussions held at a number of different meetings:
The work group was initially inspired by the HNML. HyperNietzsche Markup Language and following versions (GML Genetic Markup Language) produced by Paolo D'Iorio and colleagues from the HyperNietzsche project. We would like to thank Paolo D'Iorio for his invaluable contribution in the early stages of the work.
This version of the document has been extensively revised by Elena Pierazzo and Lou Burnard, for presentation at a panel to be held at the Annual TEI Members Meeting in November 2009.
The planned evolution of this document and the encoding model it describes may be summarized as follows:
This draft document is publicly available for discussion and feedback from the community. The document source is maintained in the TEI subversion repository at http://tei.svn.sourceforge.net/viewvc/tei/trunk/genetic/ ; information about the development of the proposals and associated materials, including complete drafts of this document, are hosted on the TEI Wiki at http://wiki.tei-c.org/index.php/Category:Genetic_Editions .
Although the entire document is a draft and therefore susceptible of changes, some sections are less stable than others. In particular, when a section or a particular element requires further discussion or is considered an open problem, such a section or element is marked by a * mark.
As required for TEI conformance, non-TEI elements are defined in a distinct non-TEI namespace. In the usage examples and throughout this document that namespace is mapped to the prefix ge:, while TEI elements are not marked by any namespace prefix.
The genetic approach differs from other approaches to the study of texts because it aims not only to identify ‘what is on the page’, but also to reconstruct the process necessary to produce ‘what is on the page’.
Because our model aims to be independent of presuppositions associated with any particular theoretical framework, we begin by reviewing some typical dichotomies in editorial theory.
In German editorial theory there is a well known opposition between what is there in the source document, the record (Befund), and the interpretation of this phenomenon (Deutung). This opposition implies that there is a way to talk about the record without any interpretation. Yet at some possibly simplistic level, everything we say about a text is based on interpretation, particularly in the realm of genetic criticism. 1 At the same time, there is an obvious difference between the interpretation that some trace of ink is indeed a specific letter and the assumption that a change in one line of a manuscript must have been made at the same time as a change in another line because their effects are textually related (for example, the first change was to a rhyming word, which necessitated the second change). Therefore we propose to talk about differing levels of interpretation, thus differentiating between ‘what’s there’ (document/fact) and ‘how does it relate’ (text/interpretation).
In Manuscript Studies (Editing, Codicology, Palaeography, Art History, History) the first level of enquiry is always the document, the physical support that lies in front of the scholar’s eyes.
To understand the text that is contained in the manuscript, a deep study of the manuscript itself is fundamental: the layout, the type of script, the type of writing support, the binding and many other aspects are able to tell us about when, where, and why this particular text was composed. The text therefore represents a different level of enquiry: it is a construct, derived from the reading of the documents.
In the case of modern draft manuscripts scholars must give detailed consideration to the layout, the different stratifications of writing and the disposition of these in the physical space; all of these, together with an understanding of the text, are required to gain insight about the composition, time of revisions, and flow (flux) of the text. Furthermore, in some cases, we know that the kind of physical support used to record it not only influences but may also actually determine the text itself. For instance, the content and the length of letters are often determined by the size and quantity of the paper available to the writer; even more so for items such as postcards.
The TEI has traditionally prioritised the text level. Of the two possible views available to someone transcribing a primary source (text and document), the TEI privileges the text (hence Text Encoding Initiative). Such physical or topographical information as a typical TEI encoding provides is subordinate to the main structural encoding, whether because it is represented by empty elements (<pb/>, <lb/>, <cb/>) or attributes (<add place="">, <note place="">, or rend). The TEI thus reflects the not uncommon view that, while relevant, documents are somehow less relevant than the texts they embody; to use a bibliographical metaphor, texts are ‘substantial’ while documents are ‘accidental’.
However, for genetic editions a focus on the document is crucial. In many cases, the only way to reconstruct the process of writing and re-writing which leads to a new text is to examine a specific document. We therefore propose to complement the existing text-focussed approach with a new encoding scheme focussed instead on the document.
We should then clarify the way we will use the following words:
Modern genetic editions encode the genetic process within one manuscript and over the course of two or more manuscripts; in this latter case quite often they also offer a view of each of the manuscripts as a single self-contained object. This is because the manuscript view provides the material basis for the relationships established by the inter-manuscript relationship. Therefore we propose to differentiate between the following aspects of a genetic edition:
| type | characterizes the element in some sense, using any convenient classification scheme or typology. |
| rotate | indicates the amount by which this zone has been rotated clockwise, with respect to the normal orientation of the parent surface element as implied by the dimensions given in the msDesc section or by the coordinates of the surface itself. The orientation is expressed in arc degrees. |
| stage | points to a <stageNote> which contains a description of a text-stage to which the editors think the alteration marked by the element bearing this attribute (and its children) belongs. |
Like a facsimile, a <ge:document> contains information about the written surfaces constituting a document. Because of this similarity, we would like to use the same elements (surface and zone) as proposed in the existing TEI scheme, although these place limits on what can be described. Specifically, the zone element as currently defined can represent only a rectangular area; it also lacks any way of stating the baseline applicable to any writing contained within it
The size of the writing surface is defined by a set of cartesian coordinates measured from the top left corner. The co-ordinates of all zones identified within the writing surface are given in terms of the same co-ordinates, as further discussed in the TEI proposals for facsimile. It will often be the case that explicit dimensions for a manuscript page (expressed in mm for example) are also supplied in a msDesc element in the TEI Header, but this is not a requirement; in particular there is no assumption that the co-ordinate system defined by a surface maps to any particular external dimensions, nor that the co-ordinate systems of different documents necessarily correspond.
A surface element may contain any number of zone graphic or line elements. The graphic element is used to point to any graphic (non textual) component forming part of the page, in the usual TEI manner. The zone element is used to delimit any contiguous section of writing which the encoder wishes to identify for some purpose.
Zones can be nested and grouped, and can also overlap. Their positioning with respect to the surface element is defined by coordinate values taken from the same co-ordinate system as the surface itself, measured from the top left corner. The element carries a rotate attribute which describes (in degrees) the orientation of the surface with respect to the content (writing, images) in that zone, with respect to its normal orientation. Note that the mechanism aims to describes the process by which the content of a specific zone has been supplied (i.e. the author has physically rotated the writing surface) rather than the orientation of the writing.
Zones are arbitrarily defined by the encoder according to the layout of the writing surface and can make use of a standardised vocabulary (e.g. the top margin).
To overcome the inherent limitations of using the existing zone and surface elements, we propose to extend their capability to include the definition of arbitrary polygons and baselines, probably by embedding appropriate elements from the Standard Vector Graphics (SVG) XML namespace. This work is not yet complete however.
The attribute stage is used to indicate the stage in a writing campaign to which this zone has been assigned by the encoder, as further discussed in 3.3 Revision campaigns below.
Within a zone, individual lines of writing are usually distinguished using the <ge:line> element.
Is it possible to combine both perspectives within a single encoding? In general a document-based transcription, which is done page-by-page and possibly line-by-line, is almost certain to overlap with some part of a the text-based structure. The cleanest solution may be to encode both structures separately, providing both a document and a distinct text solution, perhaps using some form of external pointing to link the two, and minimizing redundancy of encoding by using XInclude. This option is further discussed below and also in the TEI Guidelines.
| binder | Describe the method by which a patch is or was connected to the main surface |
| type | characterizes the element in some sense, using any convenient classification scheme or typology. |
| height | height of the patch in mm |
| width | width of the patch in mm |
Traces of authorial alteration (correction, addition, deletion, etc.) are frequently found within a single document, and may also be inferred when different documents are compared. It is however an open question as to whether inter-document discrepancies at the dossier level should be regarded in the same way as intra-document alterations. If two witnesses are collated, we may observe that a word present in one is missing from the other: does it necessarily follow that this is an addition or a deletion, which we would not hesitate to mark with an add or del tag if we are transcribing a single manuscript? We return to this question below.
| function | describes the function (e.g. add, delete, alternate) of the mark. |
| targets | indicates the element(s) to which the function of the meta-mark refers. Pointers are separated by a white space |
| cause | documents the presumed cause of the rewriting. |
| spanTo | indicates the end of a span initiated by the element bearing this attribute. |
| spanTo | indicates the end of a span initiated by the element bearing this attribute. |
A writer may sometimes rewrite material a second time without significant change and in the same place. We consider this a distinct activity from addition as usually defined because no new textual material results but the status of existing material changes. We distinguish two variants of this: fixation where the first version was a tentative draft which is subsequently fixed, for example by inking it over; and clarification, where the first version was badly written and has been rewritten for clarity. The element <ge:rewrite> is provided to cover both cases.
In general deletion in a source is marked using the del or delSpan element. However, it is useful to distinguish cases where a passage has been ‘indicated as superfluous or spurious in the copy text by an author, scribe, annotator, or corrector’ (TEI P5, s.v. del) from cases where a passage has been struck through or otherwise marked as having been used or copied to another location. In this latter case, the author does not intend to suppress the content, but only to mark that it has been transferred or reused. The element <ge:used> is provided to mark this kind of ‘deletion’.
By metamark we mean marks such as numbers, arrows, crosses, or other symbols introduced by the writer into a document expressly for the purpose of indicating how the text is to be read. Such marks thus constitute a kind of markup of the document, rather than forming part of the text.
Unlike marginal notes or other additions to the text, meta-marks indicate a deliberate alteration of the writing (e.g. ‘move this passage over there’). We also consider as metamarks dates introduced to mark the beginning of a manuscript or a revision, but not forming part of it.
The <ge:metaMark> element carries a function attribute which specifies the function of the meta-mark and a targets attribute which points to the element or elements concerned.
At regular points throughout the various drafts of the work, a number occurs, usually in the right margin (in this instance, "100"). These numbers result from the author counting the number of verse lines he has composed to the given point, and are not part of the text, but represent a stage at which Moore is taking stock of the progress of his composition.
Metamarks are commonly used in the context of transposition, that is, the moving of words or blocks by the author to a different position using arrows, asterisks or numbers or other metamarks. One possible approach (used, for instance in HNML) would be to regard such transpositions as a special kind of substitution, and actually to represent the result of the transposition indicated by the metamarks in the encoding, for example by considering the segment previous to the transposition as deleted, and substituted by the one after the transposition.
One or more transposeGrp elements may be supplied either embedded within the text or in the profileDesc of the header, depending on local preference. Each transposeGrp can contain one or more transpose element, each of which defines a single transposition.
In some cases an author indicates that an alteration is itself to be altered: for example, a struck through passage may be restored via a dotted underlining, or the underlining of a passage may be deleted by a wavy line.
This has obvious similarities to the existing revisionDesc element, but concerns the source document or set of documents rather than the TEI document representing them. We therefore propose using the existing change element for the purpose of documenting individual text stages. The existing element creation (within the TEI Header profile description) is defined as the appropriate location for all information relating to the genesis or production of a text; we might therefore modify it slightly to permit a new stageHist element which contains a number of change elements, one for each identified stage. This would also be closely analogous to the existing recordHist element, which documents changes in the catalogue record related to an artefact, as well as the revisionDesc which documents changes in the digital artefact itself.
The order of change elements within the stageHist will normally be given from the earliest to latest, where this is known. The existing change element carries a number of attributes from the att.datable class (period, when, notBefore, notAfter, from, and to) which allow each stage to be dated as exactly or inexactly as necessary, in the same way as is currently possible for the TEI date element.
Typically, each change element will contain references to other annotations contained within the teiHeader or in the document, but its contents are purely documentary.
The targets of the various pointers (transp-1, insertion-1 etc.) in the above example may be any part of the transcribed document which has been marked up and allocated an identifier, such as the pages or insertion points mentioned above. The former will presumably be marked as <ge:surface> elements or zone elements, while the latter may be marked using the generic TEI anchor element.
Alternatively, or in addition, we propose a generic state attribute which can point in the opposite direction, and associate any sequence of mark-up in a document with a change element, thus allocating that particular writing event to a particular revision campaign.
Because a typical revision campaign will comprise very many individual modifications (possibly hundreds) an element called mod (for modification) is proposed as a means of delimiting the scope of all modifications to be assigned to a given change. This is a milestone-like (empty) element, placed at the start of the text affected, and indicating the end of that range by means of an spanTo attribute.
In a case like this, there is no particular assertion about the order in which any of the various modifications making up this revision campaign were effected. If such detailed analysis is required, the existing seq attribute may be used to supply a sequence number. For example, if there are two additions within a given stage, and it is clear that one precedes the other, this could be indicated by giving the earlier one a seq attribute with the value 1 and the later one a seq attribute with the value 2.
The use of tags such as del and add necessarily implies that the modification concerned was made at some time after the original writing. An exception to this is where a false start or ‘instant’ correction has been identified: the author starts to write, and then immediately corrects what has been written. A special mechanism is provided for this case: the seq attribute may take the value 0 to indicate that the addition or deletion is considered to belong to the same writing stage as the rest of the unmodified document.
The term dossier is used to refer to the set of documents which a genetic editor considers as having contributed to the evolution of a particular text. These may include drafts, revisions, or documents related in other ways. Since each such document will most probably be encoded as a distinct TEI document with its own TEI Header, the natural way to encode a dossier would be to use the existing teiCorpus element. This would provide a TEI Header in which metadata regarding the organization of the dossier itself can be recorded, independently of the metadata regarding each particular document contained within it, which would be held in a discrete TEI Header attached to that particular encoded document.
Looking at the documents which constitute a given dossier, there are many types of relationships which can be identified, both amongst complete documents, and amongst parts of those documents, including even alterations, revisions and other compositional phenomena. A further complexity arises if for example an author chooses to correct two different versions at the same time, 3 . We may thus need to express that two or more documents are related in different ways; for instance, one document may be the sequel of another, one may have been drafted at the same time as another, one may contain material or treat topics related to those of another, for example a newspaper article may inspire or be quoted by a given work.
Here the standard TEI link element has been used to point to the documents which are related in some way. The type attribute could be used to distinguish nuances of relationship.
By ‘genetic relations’ we mean the ordering of the different text-stages represented, either within a single document or more probably, across different documents, into a hypothetical line of development, going, for instance from a version A to a version B (that can be represented by a different document or by an editorially reconstructed text-stage), and then to a version C, etc.
While a <ge:geneticNote> simply describes what a group of documents have in common, a genetic relation will tries to organise them into an idealised genetic or evolutionary line. The TEI offers a number of generic methods for representing such structures in the P5 chapter on graphs networks and trees, from which we adopt the idea of representing genetic relations as directed acyclic graphs
A graph is a structure composed of many nodes and arcs. Each node represents one document, document component, or revision stage (as defined in 3.3 Revision campaigns above), and each arc represents a connexion of some kind between two nodes. Arcs may be typed to distinguish different kinds of relationship. For our purposes, the graph is directed, because we wish to represent a particular path through it, and acyclic, because a given node can appear at one point in the graph, although it may of course be used by many other different nodes. The graph defined by a genetic relation resembles a family tree, in that there is a single terminal node, representing the final state, with many preceding nodes linking to it, either directly or via other nodes. 6
The number of possible graphs that might be drawn for a given dossier is not of course limited in any way: the encoder may derive as many as they wish. They may also wish to represent other forms of syntagmatic structure, for narratalogical or other purposes, using essentially the same mechanism.
| from | gives the identifier of the node which is adjacent from this arc. |
| to | gives the identifier of the node which is adjacent to this arc. |
| value | provides the value of a node, which is a feature structure or other analytic element. |
| adjTo | (adjacent to) gives the identifiers of the nodes which are adjacent to the current node. |
| adjFrom | (adjacent from) gives the identifiers of the nodes which are adjacent from the current node. |
As noted above, not all kinds of variation within and between documents are equivalent. For example, most people would regard authorial modifications within a single draft or between subsequent drafts as having a different significance from modifications assigned to scribal variation within a long textual tradition, despite their formal similarities.
When a passage has been visibly deleted in one version of a text we will generally mark it explicitly; if however a passage present in one version (A) is omitted in another (B), it may be a matter of uncertainty as to whether it has been deleted from B, or added to A. Even if this is certain (perhaps because the order of the two versions is known), the omission from B of material in A is not entirely the same phenomenon as an explicit deletion.
The addition (or deletion) of a segment from a version is normally a deliberate act of the author and we would like to be able to record that in positive way; whether we need another set of editorial elements or we should use the same set that are used for transcription remains an open question.
The chronology (timing) of parts of a document or documents may be expressed in absolute terms (at such a time on such a date), or relatively (before or after some other event). Relative time can also be expressed by the relation to the (known or unknown) creation of another document or text. Dating can be justified by prose and/or by reference to a characteristic of a manuscript (e.g. hand, ink, etc.). The outlining of a chronology for a document or dossier can then be used as an argument to determine the existence of a text-stage.
The TEI provides a timeline element which can be used to define a scale, a co-ordinate system for measuring time. This enables us to align other components of a document with particular points in time, each such point being represented by a when element. The temporal inter-relation of when elements is expressed by attributes stating, for example, that this point in time is so many hours or years after another, or absolutely using a standardized notation for date and time (see further discussion in the Guidelines). The alignment is done using an attribute such as sync, to state that a given part of the text is aligned with a given point in time.
This mechanism was developed originally for the representation of transcribed speech, in particular to support overlap and discontinuity at a fairly fine-grained level. It is not clear to what extent it can be generalised to support the comparatively coarse grained and imprecise notions which typify analysis of textual genesis. In particular, the need to express alternative and uncertain temporal sequences remains problematic.
One option might be to define a specific element such as <ge:evolution> comprising a series of pointers to change elements, organized in such a way as to express alternative or varyingly certain views about their sequence, possibly using the existing TEI alt element, and existing featurs for indicating degrees of uncertainty. Such pointers could also be synchronized with an external timeline using the existing TEI mechanisms if this was thought useful. The Workgroup has not completed work on elaborating these proposals however.
Genetic editing is an essentially interpretative process; documentation of all editorial decisions is conseqently of major importance.
Annotations can also occur in-line (i.e. close to a textual fragment they relate to) and in many other places; the existing note element should be used to record these.
TEI Extension for Genetic Editions -- preliminary version
| AnyThing Matches any element | |
| Module | derived-module-geneticTEI |
| Used by | |
| Declaration |
AnyThing =
(
element * { attribute * - (xml:id | xml:lang) { text }*, AnyThing }
| text
)*
|
| att.staged groups elements which can be assigned to a specific text stage by means of the attributes it provides. | |||||||||
| Module | tei | ||||||||
| Members | att.transcriptional [add addSpan del delSpan restore rewrite subst] line metaMark mod undo used zone | ||||||||
| Attributes | In addition to global attributes
|
||||||||
| <document> contains a document-centric transcription of a primary source, providing topographical information as well as transcription | |
| Module | derived-module-geneticTEI |
| In addition to global attributes | att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) |
| Used by | |
| May contain |
transcr:
surface
|
| Declaration |
element document { att.global.attributes, surface+ }
|
| <fallback> Wrapper for fallback elements if an XInclude fails | |
| Module | derived-module-geneticTEI |
| Used by | |
| May contain | Empty element |
| Declaration |
element fallback { AnyThing }
|
| <geneticGrp> Group texts and document which are somehow related in a genetic process | |
| Module | derived-module-geneticTEI |
| In addition to global attributes | att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) |
| Used by | |
| May contain |
derived-module-geneticTEI:
geneticNote
|
| Declaration |
element geneticGrp { att.global.attributes, geneticNote+ }
|
| <geneticNote> describes a particular set of documents or document fragments which are considered to be mutually associated in some way. | |
| Module | derived-module-geneticTEI |
| In addition to global attributes | att.typed (@type, @subtype) att.editLike (@evidence, @source) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max)) ) (att.responsibility (@cert, @resp)) att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) |
| Used by | |
| May contain | |
| Declaration |
element
geneticNote
{
att.typed.attributes,
att.editLike.attributes,
att.global.attributes,
linkGrp+,
model.pLike+
}
|
| <include> The W3C XInclude element | |||||||||||||||||||||||||||||||||||||||||||
| Module | derived-module-geneticTEI | ||||||||||||||||||||||||||||||||||||||||||
| In addition to global attributes | In addition to global attributes
|
||||||||||||||||||||||||||||||||||||||||||
| Used by | |||||||||||||||||||||||||||||||||||||||||||
| May contain |
derived-module-geneticTEI:
fallback
|
||||||||||||||||||||||||||||||||||||||||||
| Declaration |
element
include
{
attribute href { xsd:anyURI }?,
attribute parse { "xml" | "text" }?,
attribute xpointer { text }?,
attribute encoding { text }?,
attribute accept { text }?,
attribute accept-charset { text }?,
attribute accept-language { text }?,
fallback?
}
|
||||||||||||||||||||||||||||||||||||||||||
| <line> contains the transcription of a topographic line in the source document | |
| Module | derived-module-geneticTEI |
| In addition to global attributes | att.staged (@stage) att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) |
| Used by | |
| May contain | |
| Declaration |
element
line
{
att.staged.attributes,
att.global.attributes,
(
text
| model.global
| model.pPart.transcriptional
| model.pPart.editorial
| model.segLike
| model.gLike
| model.hiLike
)*
}
|
| <metaMark> (meta mark) A textual or graphical element in a manuscript that is functional but not part of the text. Could transform the text, like a strikethrough, or provide meta-information, like a date. | |||||||||||||
| Module | derived-module-geneticTEI | ||||||||||||
| In addition to global attributes |
att.spanning (@spanTo) att.placement (@place) att.staged (@stage) att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs))
|
||||||||||||
| Used by | |||||||||||||
| May contain |
core:
abbr
add
address
bibl
cb
choice
cit
corr
date
del
desc
distinct
email
emph
expan
foreign
gap
gloss
graphic
hi
index
l
label
lb
lg
list
listBibl
measure
measureGrp
mentioned
milestone
name
note
num
orig
p
pb
ptr
q
quote
ref
reg
rs
said
sic
soCalled
sp
stage
term
time
title
unclear
gaiji:
g
header:
biblFull
msdescription:
catchwords
depth
dim
dimensions
height
heraldry
locus
locusGrp
material
msDesc
origDate
origPlace
secFol
signatures
stamp
watermark
width
namesdates:
affiliation
country
listEvent
listNym
persName
placeName
settlement
textstructure:
floatingText
|
||||||||||||
| Declaration |
element
metaMark
{
att.spanning.attributes,
att.placement.attributes,
att.staged.attributes,
att.global.attributes,
attribute function { token { pattern = "(\p{L}|\p{N}|\p{P}|\p{S})+" } }?,
attribute targets { list { xsd:anyURI+ } }?,
macro.specialPara
}
|
||||||||||||
| <mod> defines the scope of an area in the document containing several alterations which are considered as belonging to the same revision campaign. | |
| Module | derived-module-geneticTEI |
| In addition to global attributes | att.spanning (@spanTo) att.staged (@stage) att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) |
| Used by | |
| May contain | Empty element |
| Declaration |
element
mod
{
att.spanning.attributes,
att.staged.attributes,
att.global.attributes,
empty
}
|
| model.zonePart elements which can form part of a zone | |
| Module | derived-module-geneticTEI |
| Used by | |
| Members | line zone |
| <patch> contains a part of a written surface which was originally physically distinct but became attached to it at the time that one or more written zones were created on it. | |||||||||||||||||||||||||||
| Module | derived-module-geneticTEI | ||||||||||||||||||||||||||
| In addition to global attributes |
att.coordinated (@start, @ulx, @uly, @lrx, @lry) att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.typed (@type, @subtype)
|
||||||||||||||||||||||||||
| Used by | |||||||||||||||||||||||||||
| May contain | |||||||||||||||||||||||||||
| Declaration |
element
patch
{
att.coordinated.attributes,
att.global.attributes,
att.typed.attributes,
attribute binder { xsd:Name }?,
attribute flipping { xsd:boolean }?,
attribute
height
{
xsd:double | token { pattern = "(\-?[\d]+/\-?[\d]+)" } | xsd:decimal
}?,
attribute
width
{
xsd:double | token { pattern = "(\-?[\d]+/\-?[\d]+)" } | xsd:decimal
}?,
( text | zone | model.global )*
}
|
||||||||||||||||||||||||||
| <rewrite> contains a sequence of text which has been rewritten by the author, for example by over-inking, to clarify or fix it. | |||||||||
| Module | derived-module-geneticTEI | ||||||||
| In addition to global attributes |
att.spanning (@spanTo) att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) att.transcriptional (@hand, @status, @seq) (att.staged (@stage)) (att.editLike (@evidence, @source) (att.dimensions (@unit, @quantity, @extent, @precision, @scope) (att.ranging (@atLeast, @atMost, @min, @max)) ) (att.responsibility (@cert, @resp)) )
|
||||||||
| Used by | |||||||||
| May contain |
core:
abbr
add
address
bibl
cb
choice
cit
corr
date
del
desc
distinct
email
emph
expan
foreign
gap
gloss
graphic
hi
index
label
lb
list
listBibl
measure
measureGrp
mentioned
milestone
name
note
num
orig
p
pb
ptr
q
quote
ref
reg
rs
said
sic
soCalled
stage
term
time
title
unclear
gaiji:
g
header:
biblFull
msdescription:
catchwords
depth
dim
dimensions
height
heraldry
locus
locusGrp
material
msDesc
origDate
origPlace
secFol
signatures
stamp
watermark
width
namesdates:
affiliation
country
listEvent
listNym
persName
placeName
settlement
|
||||||||
| Declaration |
element
rewrite
{
att.spanning.attributes,
att.global.attributes,
att.transcriptional.attributes,
attribute cause { "fix" | "unclear" }?,
macro.paraContent
}
|
||||||||
| Note |
Multiple rewritings are indicated by nesting one
rewrite within another. In principle, a rewriting differs
from a substitution in that second and subsequent rewrites do not materially alter the content of an element. Where there are
minor changes made during the rewriting however these may be marked up
using del, add, etc. with an appropriate value for the stage
attribute.
|
||||||||
| <stageHist> contains one or more descriptions of the stages which have been identified in the genesis of a text. | |
| Module | derived-module-geneticTEI |
| In addition to global attributes | att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) |
| Used by | |
| May contain | |
| Declaration |
element
stageHist
{
att.global.attributes,
( model.pLike+ | ( summary?, change+ ) )
}
|
| <transpose> describes a single textual transposition as an ordered list of at least two pointers specifying the order in which the elements indicated should be re-combined. | |
| Module | derived-module-geneticTEI |
| In addition to global attributes | att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) |
| Used by | |
| May contain |
core:
ptr
|
| Declaration |
element transpose { att.global.attributes, ( ptr, ptr+ ) }
|
| <transposeGrp> supplies a list of transpositions indicated at some point in the text, typically by means of metamarks. | |
| Module | derived-module-geneticTEI |
| In addition to global attributes | att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) |
| Used by | |
| May contain |
derived-module-geneticTEI:
transpose
|
| Declaration |
element transposeGrp { att.global.attributes, transpose+ }
|
| <undo> Marks up an action represented by an element to be undone. | |||||||
| Module | derived-module-geneticTEI | ||||||
| In addition to global attributes |
att.spanning (@spanTo) att.staged (@stage) att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs))
|
||||||
| Used by | |||||||
| May contain | Empty element | ||||||
| Declaration |
element
undo
{
att.spanning.attributes,
att.staged.attributes,
att.global.attributes,
attribute target { xsd:anyURI }?,
empty
}
|
||||||
| <used> In many cases, authors mark portions of text as having been used, usually meaning the text has been transcribed to a fair copy. The mark is often a strikethrough, but can be any author-specific mark. | |
| Module | derived-module-geneticTEI |
| In addition to global attributes | att.spanning (@spanTo) att.staged (@stage) att.global (@xml:id, @n, @xml:lang, @rend, @rendition, @xml:base) (att.global.linking (@corresp, @synch, @sameAs, @copyOf, @next, @prev, @exclude, @select)) (att.global.analytic (@ana)) (att.global.facs (@facs)) |
| Used by | |
| May contain | Empty element |
| Declaration |
element
used
{
att.spanning.attributes,
att.staged.attributes,
att.global.attributes,
empty
}
|
| TEI: (TEI document) contains a single TEI-conformant document, comprising a TEI header and a text, either in isolation or as part of a teiCorpus element. |
| ab: (anonymous block) contains any arbitrary component-level unit of text, acting as an anonymous container for phrase or inter level elements analogous to, but without the semantic baggage of, a paragraph. |
| abbr: (abbreviation) contains an abbreviation of any sort. |
| accMat: (accompanying material) contains details of any significant additional material which may be closely associated with the manuscript being described, such as non-contemporaneous documents or fragments bound in with the manuscript at some earlier historical period. |
| acquisition: contains any descriptive or other information concerning the process by which a manuscript or manuscript part entered the holding institution. |
| actor: Name of an actor appearing within a cast list. |
| add: (addition) contains letters, words, or phrases inserted in the text by an author, scribe, annotator, or corrector. |
| addSpan: (added span of text) marks the beginning of a longer sequence of text added by an author, scribe, annotator or corrector (see also add). |
| additional: groups additional information, combining bibliographic information about a manuscript, or surrogate copies of it with curatorial or administrative information. |
| additions: contains a description of any significant additions found within a manuscript, such as marginalia or other annotations. |
| addrLine: (address line) contains one line of a postal address. |
| address: contains a postal address, for example of a publisher, an organization, or an individual. |
| adminInfo: (administrative information) contains information about the present custody and availability of the manuscript, and also about the record description itself. |
| affiliation: (affiliation) contains an informal description of a person's present or past affiliation with some organization, for example an employer or sponsor. |
| alt: (alternation) identifies an alternation or a set of choices among elements or passages. |
| altGrp: (alternation group) groups a collection of alt elements and possibly pointers. |
| altIdentifier: (alternative identifier) contains an alternative or former structured identifier used for a manuscript, such as a former catalogue number. |
| am: (abbreviation marker) contains a sequence of letters or signs present in an abbreviation which are omitted or replaced in the expanded form of the abbreviation. |
| anchor: (anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element. |
| app: (apparatus entry) contains one entry in a critical apparatus, with an optional lemma and at least one reading. |
| appInfo: (application information) records information about an application which has edited the TEI file. |
| application: provides information about an application which has acted upon the document. |
| arc: encodes an arc, the connection from one node to another in a graph. |
| argument: A formal list or prose description of the topics addressed by a subdivision of a text. |
| att.ascribed: provides attributes for elements representing speech or action that can be ascribed to a specific individual. |
| att.canonical: provides attributes which can be used to associate a representation such as a name or title with canonical information about the object being named or referenced. |
| att.coordinated: elements which can be positioned within a two dimensional coordinate system. |
| att.damaged: provides attributes describing the nature of any physical damage affecting a reading. |
| att.datable: provides attributes for normalization of elements that contain dates, times, or datable events. |
| att.datable.iso: provides attributes for normalization of elements that contain datable events using the ISO 8601 standard. |
| att.datable.w3c: provides attributes for normalization of elements that contain datable events using the W3C datatypes. |
| att.declarable: provides attributes for those elements in the TEI Header which may be independently selected by means of the special purpose decls attribute. |
| att.declaring: provides attributes for elements which may be independently associated with a particular declarable element within the header, thus overriding the inherited default for that element. |
| att.dimensions: provides attributes for describing the size of physical objects. |
| att.divLike: provides attributes common to all elements which behave in the same way as divisions. |
| att.editLike: provides attributes describing the nature of a encoded scholarly intervention or interpretation of any kind. |
| att.global: provides attributes common to all elements in the TEI encoding scheme. |
| att.global.analytic: provides additional global attributes for associating specific analyses or interpretations with appropriate portions of a text. |
| att.global.facs: groups elements corresponding with all or part of an image, because they contain an alternative representation of it, typically but not necessarily a transcription of it. |
| att.global.linking: defines a set of attributes for hypertext and other linking, which are enabled for all elements when the additional tag set for linking is selected. |
| att.handFeatures: provides attributes describing aspects of the hand in which a manuscript is written. |
| att.internetMedia: provides attributes for specifying the type of a computer resource using a standard taxonomy. |
| att.interpLike: provides attributes for elements which represent a formal analysis or interpretation. |
| att.measurement: provides attributes to represent a regularized or normalized measurement. |
| att.msExcerpt: (manuscript excerpt) provides attributes used to describe excerpts from a manuscript placed in a description thereof. |
| att.naming: provides attributes common to elements which refer to named persons, places, organizations etc. |
| att.personal: (attributes for components of personal names) common attributes for those elements which form part of a personal name. |
| att.placement: provides attributes for describing where on the source page or object a textual element appears. |
| att.pointing: defines a set of attributes used by all elements which point to other elements by means of one or more URI references. |
| att.pointing.group: defines a set of attributes common to all elements which enclose groups of pointer elements. |
| att.ranging: provides attributes for describing numerical ranges. |
| att.rdgPart: attributes for elements which mark the beginning or ending of a fragmentary manuscript or other witness. |
| att.responsibility: provides attributes indicating who is responsible for something asserted by the markup and the degree of certainty associated with it. |
| att.segLike: provides attributes for elements used for arbitrary segmentation. |
| att.sourced: provides attributes identifying the source edition from which some encoded feature derives. |
| att.spanning: provides attributes for elements which delimit a span of text by pointing mechanisms rather than by enclosing it. |
| att.tableDecoration: provides attributes used to decorate rows or cells of a table. |
| att.textCritical: defines a set of attributes common to all elements representing variant readings in text critical work. |
| att.transcriptional: provides attributes specific to elements encoding authorial or scribal intervention in a text when transcribing manuscript or similar sources. |
| att.translatable: provides attributes used to indicate the status of a translatable portion of an ODD document. |
| att.typed: provides attributes which can be used to classify or subclassify elements in any way. |
| author: in a bibliographic reference, contains the name(s) of the author(s), personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority. |
| authority: (release authority) supplies the name of a person or other agency responsible for making an electronic file available, other than a publisher or distributor. |
| availability: supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, etc. |
| back: (back matter) contains any appendixes, etc. following the main part of a text. |
| bibl: (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged. |
| biblFull: (fully-structured bibliographic citation) contains a fully-structured bibliographic citation, in which all components of the TEI file description are present. |
| biblScope: (scope of citation) defines the scope of a bibliographic reference, for example as a list of page numbers, or a named subdivision of a larger work. |
| binding: contains a description of one binding, i.e. type of covering, boards, etc. applied to a manuscript. |
| bindingDesc: (binding description) describes the present and former bindings of a manuscript, either as a series of paragraphs or as a series of distinct binding elements, one for each binding of the manuscript. |
| body: (text body) contains the whole body of a single unitary text, excluding any front or back matter. |
| byline: contains the primary statement of responsibility given for a work on its title page or at the head or end of the work. |
| cRefPattern: (canonical reference pattern) specifies an expression and replacement pattern for transforming a canonical reference into a URI. |
| camera: describes a particular camera angle or viewpoint in a screen play. |
| caption: contains the text of a caption or other text displayed as part of a film script or screenplay. |
| castGroup: (cast list grouping) groups one or more individual castItem elements within a cast list. |
| castItem: (cast list item) contains a single entry within a cast list, describing either a single role or a list of non-speaking roles. |
| castList: (cast list) contains a single cast list or dramatis personae. |
| catDesc: (category description) describes some category within a taxonomy or text typology, either in the form of a brief prose description or in terms of the situational parameters used by the TEI formal textDesc. |
| catRef: (category reference) specifies one or more defined categories within some taxonomy or text typology. |
| catchwords: describes the system used to ensure correct ordering of the quires making up a codex or incunable, typically by means of annotations at the foot of the page. |
| category: contains an individual descriptive category, possibly nested within a superordinate category, within a user-defined taxonomy. |
| cb: (column break) marks the boundary between one column of a text and the next in a standard reference system. |
| cell: contains one cell of a table. |
| certainty: indicates the degree of certainty or uncertainty associated with some aspect of the text markup. |
| change: documents a particular stage in the genesis of a text. |
| char: (character) provides descriptive information about a character. |
| charDecl: (character declarations) provides information about nonstandard characters and glyphs. |
| charName: (character name) contains the name of a character, expressed following Unicode conventions. |
| charProp: (character property) provides a name and value for some property of the parent character or glyph. |
| choice: groups a number of alternative encodings for the same point in a text. |
| cit: (cited quotation) contains a quotation from some other document, together with a bibliographic reference to its source. In a dictionary it may contain an example text with at least one occurrence of the word form, used in the sense being described, or a translation of the headword, or an example. |
| classCode: (classification code) contains the classification code used for this text in some standard classification system. |
| classDecl: (classification declarations) contains one or more taxonomies defining any classificatory codes used elsewhere in the text. |
| climate: (climate) contains information about the physical climate of a place. |
| closer: groups together salutations, datelines, and similar phrases appearing as a final group at the end of a division, especially of a letter. |
| collation: contains a description of how the leaves or bifolia are physically arranged. |
| collection: contains the name of a collection of manuscripts, not necessarily located within a single repository. |
| colophon: contains the colophon of a manuscript item: that is, a statement providing information regarding the date, place, agency, or reason for production of the manuscript. |
| condition: contains a description of the physical condition of the manuscript. |
| corr: (correction) contains the correct form of a passage apparently erroneous in the copy text. |
| correction: (correction principles) states how and under what circumstances corrections have been made in the text. |
| country: (country) contains the name of a geo-political unit, such as a nation, country, colony, or commonwealth, larger than or administratively superior to a region and smaller than a bloc. |
| creation: contains information about the creation of a text. |
| custEvent: (custodial event) describes a single event during the custodial history of a manuscript. |
| custodialHist: (custodial history) contains a description of a manuscript's custodial history, either as running prose or as a series of dated custodial events. |
| damage: contains an area of damage to the text witness. |
| damageSpan: (damaged span of text) marks the beginning of a longer sequence of text which is damaged in some way but still legible. |
| date: contains a date in any format. |
| dateline: contains a brief description of the place, date, time, etc. of production of a letter, newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer. |
| decoDesc: (decoration description) contains a description of the decoration of a manuscript, either as a sequence of paragraphs, or as a sequence of topically organised decoNote elements. |
| decoNote: (note on decoration) contains a note describing either a decorative component of a manuscript, or a fairly homogenous class of such components. |
| del: (deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, annotator, or corrector. |
| delSpan: (deleted span of text) marks the beginning of a longer sequence of text deleted, marked as deleted, or otherwise signaled as superfluous or spurious by an author, scribe, annotator, or corrector. |
| depth: contains a measurement measured across the spine of a book or codex, or (for other text-bearing objects) perpendicular to the measurement given by the ‘width’ element. |
| desc: (description) contains a brief description of the object documented by its parent element, including its intended usage, purpose, or application where this is appropriate. |
| dim: contains any single measurement forming part of a dimensional specification of some sort. |
| dimensions: contains a dimensional specification. |
| distinct: identifies any word or phrase which is regarded as linguistically distinct, for example as archaic, technical, dialectal, non-preferred, etc., or as forming part of a sublanguage. |
| distributor: supplies the name of a person or other agency responsible for the distribution of a text. |
| div: (text division) contains a subdivision of the front, body, or back of a text. |
| divGen: (automatically generated text division) indicates the location at which a textual division generated automatically by a text-processing application is to appear. |
| docAuthor: (document author) contains the name of the author of the document, as given on the title page (often but not always contained in a byline). |
| docDate: (document date) contains the date of a document, as given (usually) on a title page. |
| docEdition: (document edition) contains an edition statement as presented on a title page of a document. |
| docImprint: (document imprint) contains the imprint statement (place and date of publication, publisher name), as given (usually) at the foot of a title page. |
| docTitle: (document title) contains the title of a document, including all its constituents, as given on a title page. |
| eLeaf: (leaf or terminal node of an embedding tree) provides explicitly for a leaf of an embedding tree, which may also be encoded with the eTree element. |
| eTree: (embedding tree) provides an alternative to tree element for representing ordered rooted tree structures. |
| edition: (edition) describes the particularities of one edition of a text. |
| editionStmt: (edition statement) groups information relating to one edition of a text. |
| editor: secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc. |
| editorialDecl: (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text. |
| email: (electronic mail address) contains an e-mail address identifying a location to which e-mail messages can be delivered. |
| emph: (emphasized) marks words or phrases which are stressed or emphasized for linguistic or rhetorical effect. |
| encodingDesc: (encoding description) documents the relationship between an electronic text and the source or sources from which it was derived. |
| epigraph: contains a quotation, anonymous or attributed, appearing at the start of a section or chapter, or on a title page. |
| epilogue: contains the epilogue to a drama, typically spoken by an actor out of character, possibly in association with a particular performance or venue. |
| ex: (editorial expansion) contains a sequence of letters added by an editor or transcriber when expanding an abbreviation. |
| expan: (expansion) contains the expansion of an abbreviation. |
| explicit: contains the explicit of a manuscript item, that is, the closing words of the text proper, exclusive of any rubric or colophon which might follow it. |
| extent: describes the approximate size of a text as stored on some carrier medium, whether digital or non-digital, specified in any convenient units. |
| facsimile: contains a representation of some written source in the form of a set of images rather than as transcribed or encoded text. |
| figDesc: (description of figure) contains a brief prose description of the appearance or content of a graphic figure, for use when documenting an image without displaying it. |
| figure: groups elements representing or containing graphic information such as an illustration or figure. |
| fileDesc: (file description) contains a full bibliographic description of an electronic file. |
| filiation: contains information concerning the manuscript's filiation, i.e. its relationship to other surviving manuscripts of the same text, its protographs, antigraphs and apographs. |
| finalRubric: contains the string of words that denotes the end of a text division, often with an assertion as to its author and title, usually set off from the text itself by red ink, by a different size or type of script, or by some other such visual device. |
| floatingText: contains a single text of any kind, whether unitary or composite, which interrupts the text containing it at any point and after which the surrounding text resumes. |
| foliation: describes the numbering system or systems used to count the leaves or pages in a codex. |
| foreign: (foreign) identifies a word or phrase as belonging to some language other than that of the surrounding text. |
| forest: provides for groups of rooted trees. |
| forestGrp: (forest group) provides for groups of forests. |
| formula: contains a mathematical or other formula. |
| front: (front matter) contains any prefatory matter (headers, title page, prefaces, dedications, etc.) found at the start of a document, before the main body. |
| funder: (funding body) specifies the name of an individual, institution, or organization responsible for the funding of a project or text. |
| fw: (forme work) contains a running head (e.g. a header, footer), catchword, or similar material appearing on the current page. |
| g: (character or glyph) represents a non-standard character or glyph. |
| gap: (gap) indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible. |
| geoDecl: (geographic coordinates declaration) documents the notation and the datum used for geographic coordinates expressed as content of the <geo> element elsewhere within the document. |
| gloss: identifies a phrase or word used to provide a gloss or definition for some other word or phrase. |
| glyph: (character glyph) provides descriptive information about a character glyph. |
| glyphName: (character glyph name) contains the name of a glyph, expressed following Unicode conventions for character names. |
| graph: encodes a graph, which is a collection of nodes, and arcs which connect the nodes. |
| graphic: indicates the location of an inline graphic, illustration, or figure. |
| group: contains the body of a composite text, grouping together a sequence of distinct texts (or groups of such texts) which are regarded as a unit for some purpose, for example the collected works of an author, a sequence of prose essays, etc. |
| handDesc: (description of hands) contains a description of all the different kinds of writing used in a manuscript. |
| handNote: (note on hand) describes a particular style or hand distinguished within a manuscript. |
| handNotes: contains one or more handNote elements documenting the different hands identified within the source texts. |
| handShift: marks the beginning of a sequence of text written in a new hand, or the beginning of a scribal stint. |
| head: (heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc. |
| height: contains a measurement measured along the axis at right angles to the bottom of the written surface, i.e. parallel to the spine for a codex or book. |
| heraldry: contains a heraldic formula or phrase, typically found as part of a blazon, coat of arms, etc. |
| hi: (highlighted) marks a word or phrase as graphically distinct from the surrounding text, for reasons concerning which no claim is made. |
| history: groups elements describing the full history of a manuscript or manuscript part. |
| hyphenation: summarizes the way in which hyphenation in a source text has been treated in an encoded version of it. |
| iNode: (intermediate (or internal) node) represents an intermediate (or internal) node of a tree. |
| idno: (identifying number) supplies any number or other identifier used to identify a bibliographic item in a standardized way. |
| imprimatur: contains a formal statement authorizing the publication of a work, sometimes required to appear on a title page or its verso. |
| incipit: contains the incipit of a manuscript item, that is the opening words of the text proper, exclusive of any rubric which might precede it, of sufficient length to identify the work uniquely; such incipts were, in fomer times, frequently used a means of reference to a work, in place of a title. |
| index: (index entry) marks a location to be indexed for whatever purpose. |
| institution: contains the name of an organization such as a university or library, with which a manuscript is identified, generally its holding institution. |
| interp: (interpretation) summarizes a specific interpretative annotation which can be linked to a span of text. |
| interpGrp: (interpretation group) collects together a set of related interpretations which share responsibility or type. |
| interpretation: describes the scope of any analytic or interpretive information added to the text in addition to the transcription. |
| item: contains one component of a list. |
| join: identifies a possibly fragmented segment of text, by pointing at the possibly discontiguous elements which compose it. |
| joinGrp: (join group) groups a collection of join elements and possibly pointers. |
| keywords: contains a list of keywords or phrases identifying the topic or nature of a text. |
| l: (verse line) contains a single, possibly incomplete, line of verse. |
| label: contains the label associated with an item in a list; in glossaries, marks the term being defined. |
| lacunaEnd: indicates the end of a lacuna in a mostly complete textual witness. |
| lacunaStart: indicates the beginning of a lacuna in the text of a mostly complete textual witness. |
| langUsage: (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text. |
| language: characterizes a single language or sublanguage used within a text. |
| layout: describes how text is laid out on the page, including information about any ruling, pricking, or other evidence of page-preparation techniques. |
| layoutDesc: (layout description) collects the set of layout descriptions applicable to a manuscript. |
| lb: (line break) marks the start of a new (typographic) line in some edition or version of a text. |
| leaf: encodes the leaves (terminal nodes) of a tree. |
| lem: (lemma) contains the lemma, or base text, of a textual variation. |
| lg: (line group) contains a group of verse lines functioning as a formal unit, e.g. a stanza, refrain, verse paragraph, etc. |
| link: defines an association or hypertextual link among elements or passages, of some type not more precisely specifiable by other elements. |
| linkGrp: (link group) defines a collection of associations or hypertextual links. |
| list: (list) contains any sequence of items organized as a list. |
| listBibl: (citation list) contains a list of bibliographic citations of any kind. |
| listEvent: (list of events) contains a list of descriptions, each of which provides information about an identifiable event. |
| listNym: (list of canonical names) contains a list of nyms, that is, standardized names for any thing. |
| listWit: (witness list) lists definitions for all the witnesses referred to by a critical apparatus, optionally grouped hierarchically. |
| localName: (locally-defined property name) contains a locally defined name for some property. |
| locus: defines a location within a manuscript or manuscript part, usually as a (possibly discontinuous) sequence of folio references. |
| locusGrp: groups a number of locations which together form a distinct but discontinuous item within a manuscript or manuscript part, according to a specific foliation. |
| m: (morpheme) represents a grammatical morpheme. |
| macro.limitedContent: (paragraph content) defines the content of prose elements that are not used for transcription of extant materials. |
| macro.paraContent: (paragraph content) defines the content of paragraphs and similar elements. |
| macro.phraseSeq: (phrase sequence) defines a sequence of character data and phrase-level elements. |
| macro.phraseSeq.limited: (limited phrase sequence) defines a sequence of character data and those phrase-level elements that are not typically used for transcribing extant documents. |
| macro.specialPara: ('special' paragraph content) defines the content model of elements such as notes or list items, which either contain a series of component-level elements or else have the same structure as a paragraph, containing a series of phrase-level and inter-level elements. |
| macro.xtext: (extended text) defines a sequence of character data and gaiji elements. |
| mapping: (character mapping) contains one or more characters which are related to the parent character or glyph in some respect, as specified by the type attribute. |
| material: contains a word or phrase describing the material of which a manuscript (or part of a manuscript) is composed. |
| measure: contains a word or phrase referring to some quantity of an object or commodity, usually comprising a number, a unit, and a commodity name. |
| measureGrp: (measure group) contains a group of dimensional specifications which relate to the same object, for example the height and width of a manuscript page. |
| mentioned: marks words or phrases mentioned, not used. |
| milestone: marks a boundary point separating any kind of section of a text, typically but not necessarily indicating a point at which some part of a standard reference system changes, where the change is not represented by a structural element. |
| model.addrPart: groups elements such as names or postal codes which may appear as part of a postal address. |
| model.addressLike: groups elements used to represent a postal or e-mail address. |
| model.applicationLike: groups elements used to record application-specific information about a document in its header. |
| model.biblLike: groups elements containing a bibliographic description. |
| model.biblPart: groups elements which represent components of a bibliographic description. |
| model.castItemPart: groups component elements of an entry in a cast list, such as dramatic role or actor's name. |
| model.catDescPart: groups component elements of the TEI Header Category Description. |
| model.choicePart: groups elements (other than choice itself) which can be used within a choice alternation. |
| model.common: groups common chunk- and inter-level elements. |
| model.dateLike: groups elements containing temporal expressions. |
| model.dimLike: groups elements which describe a measurement forming part of the physical dimensions of some object. |
| model.div1Like: groups top-level structural divisions. |
| model.divBottom: groups elements appearing at the end of a text division. |
| model.divBottomPart: groups elements which can occur only at the end of a text division. |
| model.divGenLike: groups elements used to represent a structural division which is generated rather than explicitly present in the source. |
| model.divLike: groups elements used to represent un-numbered generic structural divisions. |
| model.divPart: groups paragraph-level elements appearing directly within divisions. |
| model.divTop: groups elements appearing at the beginning of a text division. |
| model.divTopPart: groups elements which can occur only at the beginning of a text division. |
| model.divWrapper: groups elements which can appear at either top or bottom of a textual division. |
| model.editorialDeclPart: groups elements which may be used inside editorialDecl and appear multiple times. |
| model.egLike: groups elements containing examples or illustrations. |
| model.emphLike: groups phrase-level elements which are typographically distinct and to which a specific function can be attributed. |
| model.encodingDescPart: groups elements which may be used inside encodingDesc and appear multiple times. |
| model.entryPart: groups elements appearing at any level within a dictionary entry. |
| model.entryPart.top: groups high level elements within a structured dictionary entry |
| model.frontPart: groups elements which appear at the level of divisions within front or back matter. |
| model.frontPart.drama: groups elements which appear at the level of divisions within front or back matter of performance texts only. |
| model.gLike: groups elements used to represent individual non-Unicode characters or glyphs. |
| model.global: groups elements which may appear at any point within a TEI text. |
| model.global.edit: groups globally available elements which perform a specifically editorial function. |
| model.global.meta: groups globally available elements which describe the status of other elements. |
| model.glossLike: groups elements which provide an alternative name, explanation, or description for any markup construct. |
| model.graphicLike: groups elements containing images, formulae, and similar objects. |
| model.headLike: groups elements used to provide a title or heading at the start of a text division. |
| model.hiLike: groups phrase-level elements which are typographically distinct but to which no specific function can be attributed. |
| model.highlighted: groups phrase-level elements which are typographically distinct. |
| model.imprintPart: groups the bibliographic elements which occur inside imprints. |
| model.inter: groups elements which can appear either within or between paragraph-like elements. |
| model.lLike: groups elements representing metrical components such as verse lines. |
| model.labelLike: groups elements used to gloss or explain other parts of a document. |
| model.limitedPhrase: groups phrase-level elements excluding those elements primarily intended for transcription of existing sources. |
| model.listLike: groups list-like elements. |
| model.measureLike: groups elements which denote a number, a quantity, a measurement, or similar piece of text that conveys some numerical meaning. |
| model.milestoneLike: groups milestone-style elements used to represent reference systems. |
| model.msItemPart: groups elements which can appear within a manuscript item description. |
| model.msQuoteLike: groups elements which represent passages such as titles quoted from a manuscript as a part of its description. |
| model.nameLike: groups elements which name or refer to a person, place, or organization. |
| model.nameLike.agent: groups elements which contain names of individuals or corporate bodies. |
| model.noteLike: groups globally-available note-like elements. |
| model.pLike: groups paragraph-like elements. |
| model.pLike.front: groups paragraph-like elements which can occur as direct constituents of front matter. |
| model.pPart.data: groups phrase-level elements containing names, dates, numbers, measures, and similar data. |
| model.pPart.edit: groups phrase-level elements for simple editorial correction and transcription. |
| model.pPart.editorial: groups phrase-level elements for simple editorial interventions that may be useful both in transcribing and in authoring. |
| model.pPart.msdesc: groups phrase-level elements used in manuscript description. |
| model.pPart.transcriptional: groups phrase-level elements used for editorial transcription of pre-existing source materials. |
| model.persStateLike: groups elements describing changeable characteristics of a person which have a definite duration, for example occupation, residence, or name. |
| model.personPart: groups elements which form part of the description of a person. |
| model.phrase: groups elements which can occur at the level of individual words or phrases. |
| model.physDescPart: groups specialised elements forming part of the physical description of a manuscript or similar written source. |
| model.placeNamePart: groups elements which form part of a place name. |
| model.placeStateLike: groups elements which describe changing states of a place. |
| model.placeTraitLike: groups elements which describe unchanging traits of a place. |
| model.profileDescPart: groups elements which may be used inside profileDesc and appear multiple times. |
| model.ptrLike: groups elements used for purposes of location and reference. |
| model.publicationStmtPart: groups elements which may appear within the publicationStmt element of the TEI Header. |
| model.qLike: groups elements related to highlighting which can appear either within or between chunk-level elements. |
| model.quoteLike: groups elements used to directly contain quotations. |
| model.rdgLike: groups elements which contain a single reading, other than the lemma, within a textual variation. |
| model.rdgPart: groups elements which mark the beginning or ending of a fragmentary manuscript or other witness. |
| model.resourceLike: groups non-textual elements which may appear together with a header and a text to constitute a TEI document. |
| model.respLike: groups elements which are used to indicate intellectual or other significant responsibility, for example within a bibliographic element. |
| model.segLike: groups elements used for arbitrary segmentation. |
| model.sourceDescPart: groups elements which may be used inside sourceDesc and appear multiple times. |
| model.stageLike: groups elements containing stage directions or similar things defined by the module for performance texts. |
| model.teiHeaderPart: groups high level elements which may appear more than once in a TEI Header. |
| model.titlepagePart: groups elements which can occur as direct constituents of a title page, such as docTitle, docAuthor, docImprint, or epigraph. |
| move: (movement) marks the actual entrance or exit of one or more characters on stage. |
| msContents: (manuscript contents) describes the intellectual content of a manuscript or manuscript part, either as a series of paragraphs or as a series of structured manuscript items. |
| msDesc: (manuscript description) contains a description of a single identifiable manuscript or other text-bearing object. |
| msIdentifier: (manuscript identifier) contains the information required to identify the manuscript being described. |
| msItem: (manuscript item) describes an individual work or item within the intellectual content of a manuscript or manuscript part. |
| msItemStruct: (structured manuscript item) contains a structured description for an individual work or item within the intellectual content of a manuscript or manuscript part. |
| msName: (alternative name) contains any form of unstructured alternative name used for a manuscript, such as an ‘ocellus nominum’, or nickname. |
| msPart: (manuscript part) contains information about an originally distinct manuscript or part of a manuscript, now forming part of a composite manuscript. |
| musicNotation: contains description of type of musical notation. |
| name: (name, proper noun) contains a proper noun or noun phrase. |
| node: encodes a node, a possibly labeled point in a graph. |
| normalization: indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form. |
| note: contains a note or annotation. |
| notesStmt: (notes statement) collects together any notes providing information about a text additional to that recorded in other parts of the bibliographic description. |
| num: (number) contains a number, written in any form. |
| objectDesc: contains a description of the physical components making up the object which is being described. |
| opener: groups together dateline, byline, salutation, and similar phrases appearing as a preliminary group at the start of a division, especially of a letter. |
| orig: (original form) contains a reading which is marked as following the original, rather than being normalized or corrected. |
| origDate: (origin date) contains any form of date, used to identify the date of origin for a manuscript or manuscript part. |
| origPlace: (origin place) contains any form of place name, used to identify the place of origin for a manuscript or manuscript part. |
| origin: contains any descriptive or other information concerning the origin of a manuscript or manuscript part. |
| p: (paragraph) marks paragraphs in prose. |
| pb: (page break) marks the boundary between one page of a text and the next in a standard reference system. |
| pc: (punctuation character) a character or string of characters regarded as constituting a single punctuation mark. |
| performance: contains a section of front or back matter describing how a dramatic piece is to be performed in general or how it was performed on some specific occasion. |
| persName: (personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including any or all of the person's forenames, surnames, honorifics, added names, etc. |
| phr: (phrase) represents a grammatical phrase. |
| physDesc: (physical description) contains a full physical description of a manuscript or manuscript part, optionally subdivided using more specialised elements from the model.physDescPart class. |
| placeName: contains an absolute or relative place name. |
| postscript: contains a postscript, e.g. to a letter. |
| precision: indicates the numerical accuracy or precision associated with some aspect of the text markup. |
| principal: (principal researcher) supplies the name of the principal researcher responsible for the creation of an electronic text. |
| profileDesc: (text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting. |
| projectDesc: (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected. |
| prologue: contains the prologue to a drama, typically spoken by an actor out of character, possibly in association with a particular performance or venue. |
| provenance: contains any descriptive or other information concerning a single identifiable episode during the history of a manuscript or manuscript part, after its creation but before its acquisition. |
| ptr: (pointer) defines a pointer to another location. |
| pubPlace: (publication place) contains the name of the place where a bibliographic item was published. |
| publicationStmt: (publication statement) groups information concerning the publication or distribution of an electronic or other text. |
| publisher: provides the name of the organization responsible for the publication or distribution of a bibliographic item. |
| q: (separated from the surrounding text with quotation marks) contains material which is marked as (ostensibly) being somehow different than the surrounding text, for any one of a variety of reasons including, but not limited to: direct speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and passages that are mentioned but not used. |
| quotation: specifies editorial practice adopted with respect to quotation marks in the original. |
| quote: (quotation) contains a phrase or passage attributed by the narrator or author to some agency external to the text. |
| rdg: (reading) contains a single reading within a textual variation. |
| rdgGrp: (reading group) within a textual variation, groups two or more readings perceived to have a genetic relationship or other affinity. |
| recordHist: (recorded history) provides information about the source and revision status of the parent manuscript description itself. |
| ref: (reference) defines a reference to another location, possibly modified by additional text or comment. |
| refState: (reference state) specifies one component of a canonical reference defined by the milestone method. |
| refsDecl: (references declaration) specifies how canonical references are constructed for this text. |
| reg: (regularization) contains a reading which has been regularized or normalized in some sense. |
| relatedItem: contains or references some other bibliographic item which is related to the present one in some specified manner, for example as a constituent or alternative version of it. |
| relation: (relationship) describes any kind of relationship or linkage amongst a specified group of participants. |
| relationGrp: (relation group) provides information about relationships identified amongst people, places, and organizations, either informally as prose or as formally expressed relation links. |
| repository: contains the name of a repository within which manuscripts are stored, possibly forming part of an institution. |
| resp: (responsibility) contains a phrase describing the nature of a person's intellectual responsibility. |
| respStmt: (statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. |
| respons: (responsibility) identifies the individual(s) responsible for some aspect of the markup of particular element(s). |
| restore: indicates restoration of text to an earlier state by cancellation of an editorial or authorial marking or instruction. |
| revisionDesc: (revision description) summarizes the revision history for a file. |
| role: the name of a dramatic role, as given in a cast list. |
| roleDesc: (role description) describes a character's role in a drama. |
| root: (root node) represents the root node of a tree. |
| row: contains one row of a table. |
| rs: (referencing string) contains a general purpose name or referring string. |
| rubric: contains the text of any rubric or heading attached to a particular manuscript item, that is, a string of words through which a manuscript signals the beginning of a text division, often with an assertion as to its author and title, which is in some way set off from the text itself, usually in red ink, or by use of different size or type of script, or some other such visual device. |
| s: (s-unit) contains a sentence-like division of a text. |
| said: (speech or thought) indicates passages thought or spoken aloud, whether explicitly indicated in the source or not, whether directly or indirectly reported, whether by real people or fictional characters. |
| salute: (salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other division of a text, or the salutation in the closing of a letter, preface, etc. |
| samplingDecl: (sampling declaration) contains a prose description of the rationale and methods used in sampling texts in the creation of a corpus or collection. |
| seal: contains a description of one seal or similar attachment applied to a manuscript. |
| sealDesc: (seal description) describes the seals or other external items attached to a manuscript, either as a series of paragraphs or as a series of distinct seal elements, possibly with additional decoNotes. |
| secFol: (second folio) The word or words taken from a fixed point in a codex (typically the beginning of the second leaf) in order to provide a unique identifier for it. |
| seg: (arbitrary segment) represents any segmentation of text below the ‘chunk’ level. |
| segmentation: describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc. |
| series: (series information) contains information about the series in which a book or other bibliographic item has appeared. |
| seriesStmt: (series statement) groups information about the series, if any, to which a publication belongs. |
| set: (setting) contains a description of the setting, time, locale, appearance, etc., of the action of a play, typically found in the front matter of a printed performance text (not a stage direction). |
| settlement: contains the name of a settlement such as a city, town, or village identified as a single geo-political or administrative unit. |
| sic: (latin for thus or so ) contains text reproduced although apparently incorrect or inaccurate. |
| signatures: contains discussion of the leaf or quire signatures found within a codex. |
| signed: (signature) contains the closing salutation, etc., appended to a foreword, dedicatory epistle, or other division of a text. |
| soCalled: contains a word or phrase for which the author or narrator indicates a disclaiming of responsibility, for example by the use of scare quotes or italics. |
| sound: describes a sound effect or musical sequence specified within a screen play or radio script. |
| source: describes the original source for the information contained with a manuscript description. |
| sourceDesc: (source description) describes the source from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence. |
| sp: (speech) An individual speech in a performance text, or a passage presented as such in a prose or verse text. |
| space: indicates the location of a significant space in the copy text. |
| span: associates an interpretative annotation directly with a span of text. |
| spanGrp: (span group) collects together span tags. |
| speaker: A specialized form of heading or label, giving the name of one or more speakers in a dramatic text or fragment. |
| sponsor: specifies the name of a sponsoring organization or institution. |
| stage: (stage direction) contains any kind of stage direction within a dramatic text or fragment. |
| stamp: contains a word or phrase describing a stamp or similar device. |
| subst: (substitution) groups one or more deletions with one or more additions when the combination is to be regarded as a single intervention in the text. |
| summary: contains a brief summary of the intellectual content of an item, provided by the cataloguer. |
| supplied: signifies text supplied by the transcriber or editor for any reason, typically because the original cannot be read because of physical damage or loss to the original. |
| support: contains a description of the materials etc. which make up the physical support for the written part of a manuscript. |
| supportDesc: (support description) groups elements describing the physical support for the written part of a manuscript. |
| surface: defines a written surface in terms of a rectangular coordinate space, optionally grouping one or more graphic representations of that space, and rectangular zones of interest within it. |
| surplus: marks text present in the source which the editor believes to be superfluous or redundant. |
| surrogates: contains information about any non-digital representations of the manuscript being described which may exist in the holding institution or elsewhere. |
| taxonomy: defines a typology used to classify texts either implicitly, by means of a bibliographic citation, or explicitly by a structured taxonomy. |
| tech: (technical stage direction) describes a special-purpose stage direction that is not meant for the actors. |
| teiCorpus: contains the whole of a TEI encoded corpus, comprising a single corpus header and one or more TEI elements, each containing a single text header and a text. |
| teiHeader: (TEI Header) supplies the descriptive and declarative information making up an electronic title page prefixed to every TEI-conformant text. |
| term: contains a single-word, multi-word, or symbolic designation which is regarded as a technical term. |
| text: contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample. |
| textClass: (text classification) groups information which describes the nature or topic of a text in terms of a standard classification scheme, thesaurus, etc. |
| textLang: (text language) in a manuscript description, describes the languages and writing systems identified within the manuscript being described. |
| time: contains a phrase defining a time of day in any format. |
| timeline: (timeline) provides a set of ordered points in time which can be linked to elements of a spoken text to create a temporal alignment of that text. |
| title: contains a title for any kind of work. |
| titlePage: (title page) contains the title page of a text, appearing within the front or back matter. |
| titlePart: contains a subsection or division of the title of a work, as indicated on a title page. |
| titleStmt: (title statement) groups information about the title of a work and those responsible for its intellectual content. |
| trailer: contains a closing title or footer appearing at the end of a division of a text. |
| tree: encodes a tree, which is made up of a root, internal nodes, leaves, and arcs from root to leaves. |
| triangle: (underspecified embedding tree, so called because of its characteristic shape when drawn) Provides for an underspecified eTree, that is, an eTree with information left out. |
| typeDesc: contains a description of the typefaces or other aspects of the printing of an incunable or other printed source. |
| typeNote: describes a particular font or other significant typographic feature distinguished within the description of a printed resource. |
| unclear: contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source. |
| unicodeName: (unicode property name) contains the name of a registered Unicode normative or informative property. |
| value: (value) contains a single value for some property, attribute, or other analysis. |
| variantEncoding: declares the method used to encode text-critical variants. |
| view: describes the visual context of some part of a screen play in terms of what the spectator sees, generally independent of any dialogue. |
| watermark: contains a word or phrase describing a watermark or similar device. |
| when: indicates a point in time either relative to other elements in the same timeline tag, or absolutely. |
| width: contains a measurement measured along the axis parallel to the bottom of the written surface, i.e. perpendicular to the spine of a book or codex. |
| wit: contains a list of one or more sigla of witnesses attesting a given reading, in a textual variation. |
| witDetail: (witness detail) gives further information about a particular witness, or witnesses, to a particular reading. |
| witEnd: (fragmented witness end) indicates the end, or suspension, of the text of a fragmentary witness. |
| witStart: (fragmented witness start) indicates the beginning, or resumption, of the text of a fragmentary witness. |
| witness: contains either a description of a single witness referred to within the critical apparatus, or a list of witnesses which is to be referred to by a single sigil. |
| zone: defines a rectangular area contained within a surface element. |