21 Certainty and Responsibility
Inhalt
- the note element defined in section 3.8 Notes, Annotation, and Indexing may be used with a value of certainty for its type attribute.
- the certainty element defined in this chapter may be used to record the nature and degree of the uncertainty in a more structured way.
- the alt element defined in the module for linking and segmentation may be used to provide alternative encodings for parts of a text, as described in section 16.8 Alternation.
- the TEI header records who is responsible for an electronic text by means of the respStmt element and other more specific elements (author, sponsor, funder, principal, etc.) used within the titleStmt, editionStmt, and revisionDesc elements.
- the note element may be used with a value of resp or responsibility in its type attribute.
- the respons element defined in this chapter may be used to record fine-grained structured information about responsibility for individual tags in the text.
21.1 Levels of CertaintyTEI: Levels of Certainty¶
- a given tag may or may not correctly apply (e.g. a given word may be a personal name, or perhaps not)
- the precise point at which an element begins or ends is uncertain
- the value to be given for an attribute is uncertain
- content supplied by the encoder (such as the expansion of an abbreviation marked by the abbr tag) is uncertain
- the transcription of a source text is uncertain, perhaps because it is hard to read or hard to hear; this sort of uncertainty is also handled by the unclear element in section 11.5.1 Damage, Illegibility, and Supplied Text
- a number or date is imprecise
- the text is ambiguous, so a given passage has several possible interpretations
- a transcriber, editor, or author wishes to indicate a level of confidence in a factual assertion made in the text
- an author is not sure if the sentence she has chosen to start a paragraph is really the one she wants to retain in the final version
21.1.1 Using Notes to Record UncertaintyTEI: Using Notes to Record Uncertainty¶
Using note, the uncertainty here may be recorded quite simply:Elizabeth went to Essex. She had always liked Essex.
<note type="uncertainty" resp="#MSM">It is not
clear here whether <mentioned>Essex</mentioned>
refers to the place or to the nobleman. -MSM</note>
She had always liked <placeName xml:id="CE-p1b">Essex</placeName>.
<note type="uncertainty" resp="#MSM" target="#CE-p1a #CE-p1b">It
is not clear here whether <mentioned>Essex</mentioned>
refers to the place or to the nobleman. If the latter,
it should be tagged as a personal name. -<name xml:id="MSM">Michael</name>
</note>
The advantage of this technique is its relative simplicity. Its disadvantage is that the nature and degree of uncertainty are not conveyed in any systematic way and thus are not susceptible to any sort of automatic processing.
21.1.2 Structured Indications of UncertaintyTEI: Structured Indications of Uncertainty¶
- certainty indicates the degree of certainty or uncertainty associated with some aspect of the text markup.
<placeName xml:id="CE-pl1">Essex</placeName>.
<!-- ... elsewhere in the document ... -->
<certainty target="#CE-pl1" locus="gi">
<desc>possibly not a placename</desc>
</certainty>
<!-- ... --><certainty target="#CE-pl1" locus="gi" degree="0.6"/>
<!-- ... --><certainty target="#CE-pl1" locus="gi" degree="0.6">
<desc>probably a placename, but possibly not</desc>
</certainty>
<certainty
target="#CE-pl1"
locus="gi"
degree="0.4"
assertedValue="persName">
<desc>may refer to the Earl of Essex</desc>
</certainty>
She had always liked <placeName xml:id="CE-PL2">Essex</placeName>.
<!-- ... -->
<!-- 60% chance that P1 is a placename, 40% chance a personal name. -->
<certainty
xml:id="cert-1"
target="#CE-PL1"
locus="gi"
degree="0.6">
<desc>probably a placename, but possibly not"</desc>
</certainty>
<certainty
xml:id="cert-2"
target="#CE-PL1"
locus="gi"
assertedValue="persName"
degree="0.4">
<desc>may refer to the Earl of Essex"</desc>
</certainty>
<!-- 60% chance that P2 is a placename, 40% chance a personal name. 100% chance that it agrees with P1. -->
<certainty
target="#CE-PL2"
locus="gi"
given="#cert-1"
degree="1.0">
<desc>if P1 is a placename, P2 certainly is"</desc>
</certainty>
<certainty
target="#CE-PL2"
locus="gi"
assertedValue="persName"
degree="1.0"
given="#cert-2">
<desc>if p1 refers to the Earl of Essex, so does P2</desc>
</certainty>
<certainty
xml:id="cert1"
target="#CE-p2"
locus="gi"
degree="0.6"/>
<certainty
target="#CE-p2"
locus="startLoc"
given="#cert1"
degree="0.9"/>
<certainty
xml:id="cert2"
target="#CE-p2"
locus="gi"
assertedValue="placeName"
degree="0.4"/>
<certainty
target="#CE-p2"
locus="startLoc"
given="#cert2"
degree="0.5"/>
<certainty
xml:id="cert3"
target="#CE-p2"
locus="startLoc"
assertedValue="CE-a1"
given="#cert1"
degree="0.1"/>
<certainty
xml:id="cert4"
target="#CE-p2"
locus="startLoc"
assertedValue="CE-a1"
given="#cert2"
degree="0.5"/>
Ernest went to old <placeName>Saybrook</placeName>. (0.4 * 0.5, or 0.20)
Ernest went to <placeName>old Saybrook</placeName>. (0.4 * 0.5, or 0.20)
<certainty target="#CE-u1" locus="att.who" degree="0.5"/>
<certainty target="#CE-p3" locus="transcribedContent" degree="0.5"/>
<choice>
<expan xml:id="CE-e1">Standard
Generalized Markup Language</expan>
<expan xml:id="CE-e4">Some Grandiose Methodology for Losers</expan>
<abbr>SGML</abbr>
</choice> ...
<!-- ... -->
<certainty target="#CE-e1" locus="suppliedContent" degree="0.9"/>
<certainty
target="#CE-P3"
locus="transcribedContent"
assertedValue="gun"
degree="0.8">
<desc>a gun makes more sense in a holdup</desc>
</certainty>
The certainty element and the other TEI mechanisms for indicating uncertainty provide a range of methods of graduated complexity. Simple expressions of uncertainty may be made by using the note element. This is simple and convenient, and can accommodate either a discursive and unstructured indication of uncertainty, or a complex and structured but probably project-specific expression of uncertainty. In general, however, unless special steps are taken, the note element does not provide as much expressive power as the certainty element, and in cases where highly structured certainty information must be given, it is recommended that the certainty element be used.
The certainty element may be used for simple unqualified indications of uncertainty, in which case only the locus and target attributes might be specified. In more complex cases, the other attributes may be used to provide fuller information. While these attributes may take any string of characters as value, the recommended values should be used wherever possible; if they are not appropriate in a given situation, encoders should provide their own controlled vocabulary and document it in the encodingDesc or tagUsage elements of the TEI header.
21.2 Attribution of ResponsibilityTEI: Attribution of Responsibility¶
In general, attribution of responsibility for the transcription and markup of an electronic text is made by respStmt elements within the header: specifically, within the title statement, the edition statement(s), and the revision history.
- respons (responsibility) identifies the individual(s) responsible for some aspect of the markup of particular element(s).
<!-- ... -->
<respons target="#CE-p5" locus="transcribedContent" resp="#RC"/>
<respons target="#CE-p5" locus="gi location" resp="#PMWR"/>
<list type="encoders">
<item xml:id="PMWR"/>
<item xml:id="RC"/>
</list>
Some elements bear specialized resp or agent attributes, which have specific meanings that vary from element to element; the respons element should be reserved for the general aspects of responsibility common to all text transcription and markup, and should not be confused with the more specific attributes on individual elements.
21.3 The Certainty ModuleTEI: The Certainty Module¶
↑ Inhalt « 20 Non-hierarchical Structures » 22 Documentation Elements