Renear, H. Allen. “Text Encoding”

From CIRCA

Jump to: navigation, search
VTracker
Content inserted. (5 Occurances)
Content structure inserted. (1 Occurances)
Content changed. (4 Occurances)
Content style of a font changed. (1 Occurances)

Renear, H. Allen. ???Text Encoding.??? A Companion to Digital Humanities';this.style.color = '#ff0000';" onMouseOut = "this.innerHTML = 'A Companion to Digital Humanities';this.style.color = '#000000';">A Companion to Digital Humanities, ed. Susan Schrelbman, Ray Siemens, John Unsworth.Oxford: Blackwell, 2004. Reviewed By Joseph Dung

Renear in his article ???Text Encoding??? (2004) providesvaluable insights about the historical and theoretical contextneeded to understand ???both contemporary text encoding practicesand the various ongoing debates that surround those practices.???He expounds on the theoretical frameworks that guided thedevelopment of markup related techniques and systems and theongoing debates that surrounded them. He pointed out thattraditional humanities computing was concerned mostly withliterature and language analysis but text encoding encompasses awider sense to include new cultural products like ???new media???.In presenting a brief history of markup languages he proceeds todelineate the advantages of the dominant type of markup model thathas been developed: the Descriptive markup.

However, even as Renear discusses the Descriptive model, he discusses how that model naturally fits with our view of text as an ???Ordered Hierarchy of Content Objects??? (OHCO); an intuitive view of text structure he challenges in his other article, ???Refining our Notion of What Text Really Is: The Problem of Overlapping Hierarchies"-- with various axiomatic arguments. Since markup languages grew as a modern system of annotating text for additional processing by machines, they were initially procedural since formatting commands and instructions were embedded inside the text. Descriptive Markup coincided with the birth of SGML and had the advantage of labeling parts of the document in a way authors could understand. Underlying this Descriptive Markup is the idea that markup should be focused on the structural aspects of the document and leave the visual presentation of that structure to the interpreter. The way Descriptive models work (identifying words, lines, passages, paragraphs, headings) naturally fits the way humans understand text. Descriptive Markup labels also function like assembler mnemonics (or macros), where abbreviations could be used to represent and decode longer strings, enabling the possibility of creating global variables that can affect some aspects of the document, without entirely affecting the whole document. Renear compared and contrasted the descriptive and procedural models giving credence and weight to the former.

While providing background detail of the creation of SGML, XMLand the TEI , Renear also took time to clarify confusingterminology in the field. SGML is not a markup language in thetraditional sense, but it actually is a meta-language: a languagethat provides all the basic elements for authors to build their ownmarkup languages. SGML provided a means by which other specific???grammars??? could be built for any range of documents. Reneardiscusses how SGML finds use beyond the formatting of documents--itcan also be useful for data interchange.

In the end Renear acknowledges how SGML???s adoption graduallysuffered, first in the publishing world when WYSIWIG Word programsappeared which provided an even richer visual metaphor for textprocessing beyond macros and then second with the creation of HTML.HTML, even though it lacked an initial Document Type Definition,and had, in the words of Renear, ???an impoverished element set???,was a much simpler and forgiving markup language. Curiously, HTMLalso included both descriptive and procedural markup in its syntax??? a mix of different text encoding approaches. It is quitepossible that the very structure of WYSIWIG text processing withits visual metaphors and the non-linear nature of hypertext in HTMLchallenged, for Renear, the descriptive OHCO model of textstructure. Renear argues that a newer encompassing theory of textstructure is needed to both understand and create new techniquesfor text encoding.

Personal tools