Text Transcription Issues: Difference between revisions

From iDigBio
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
Line 1: Line 1:
== About Standards for Transcribing Text  ==
== About Standards for Transcribing Text  ==


*In our last meeting we discussed some of the challenges of transcribing text with corrections, alterations, strikeouts, ambiguous letters, etc and I briefly mentioned some transcription projects that have dealt with similar issues. A hackathon participant, Ben Brumfeld, has much more experience in this topic so first I'll point you to some information he has compiled. His blog home page (http://manuscripttranscription.blogspot.com) currently has a transcription of his talk about the variety of formats that various projects are using. A worthwhile read.  
*In our last meeting (18 Dec 2012) we discussed some of the challenges of transcribing text with corrections, alterations, strikeouts, ambiguous letters, etc and I briefly mentioned some transcription projects that have dealt with similar issues. A hackathon participant, Ben Brumfeld, has much more experience in this topic so first I'll point you to some information he has compiled. His blog home page (http://manuscripttranscription.blogspot.com) currently has a transcription of his talk about the variety of formats that various projects are using. A worthwhile read.
 
*If we decide to try to transcribe or preserve ambiguous or corrected/struckout characters, then the Text Encoding Initiative format might be a good start, though it would require the use of XML elements in brackets. A more lightweight approach might be to utilize some of the wiki markup formats like Markdown (http://daringfireball.net/projects/markdown/syntax) or Textile (http://txstyle.org).  
*If we decide to try to transcribe or preserve ambiguous or corrected/struckout characters, then the Text Encoding Initiative format might be a good start, though it would require the use of XML elements in brackets. A more lightweight approach might be to utilize some of the wiki markup formats like Markdown (http://daringfireball.net/projects/markdown/syntax) or Textile (http://txstyle.org).  
*Below I've listed some projects that either establish transcription or markup standards or have published guidelines or suggestions about how to transcribe text.  
*Below I've listed some projects that either establish transcription or markup standards or have published guidelines or suggestions about how to transcribe text.
 
**TEI elements for representing primary documents (in particular, errors, corrections, alterations, ambiguity, etc) - http://www.tei-c.org/release/doc/tei-p5-doc/en/html/PH.html#PHCH  
**TEI elements for representing primary documents (in particular, errors, corrections, alterations, ambiguity, etc) - http://www.tei-c.org/release/doc/tei-p5-doc/en/html/PH.html#PHCH  
**FreeReg, register transcription - http://www.freereg.org.uk/howto/transcribe.htm  
**FreeReg, register transcription - http://www.freereg.org.uk/howto/transcribe.htm  

Revision as of 18:37, 19 December 2012

About Standards for Transcribing Text

  • In our last meeting (18 Dec 2012) we discussed some of the challenges of transcribing text with corrections, alterations, strikeouts, ambiguous letters, etc and I briefly mentioned some transcription projects that have dealt with similar issues. A hackathon participant, Ben Brumfeld, has much more experience in this topic so first I'll point you to some information he has compiled. His blog home page (http://manuscripttranscription.blogspot.com) currently has a transcription of his talk about the variety of formats that various projects are using. A worthwhile read.
  • If we decide to try to transcribe or preserve ambiguous or corrected/struckout characters, then the Text Encoding Initiative format might be a good start, though it would require the use of XML elements in brackets. A more lightweight approach might be to utilize some of the wiki markup formats like Markdown (http://daringfireball.net/projects/markdown/syntax) or Textile (http://txstyle.org).
  • Below I've listed some projects that either establish transcription or markup standards or have published guidelines or suggestions about how to transcribe text.