The Language of the Conquerors: Opening the Lost World of the Turkic Empires for Genealogical Research

dc.audienceLibrary Services to Multicultural Populations Section
dc.audienceLocal History and Genealogy Section
dc.contributor.authorJonathan McCollum
dc.coverage.spatialUnited States of America
dc.date.accessioned2025-08-24T13:19:16Z
dc.date.available2025-08-24T13:19:16Z
dc.date.issued2025-08-24
dc.description.abstractThe imperial records of the Turkic empires of the past several centuries, initially composed in such dead languages as Ottoman and Chagatai Turkish, remain opaque for the millions of people attempting to trace their ancestry into a past in which Turks ruled over much of Europe and Asia. While these empires have receded into the pages of history, their robust records remain in the hands of thousands of state archives, libraries, private repositories, and personal collections. Archivists and librarians struggle to properly catalog and index these orthographically complex Turkic collections. Likewise, researchers without proper training in these moribund and deceased languages overlook these rich resources. Aware of the potential of Turkic records for genealogical purposes, FamilySearch International has implemented an approach to make these records accessible to non-specialists. First, this paper documents the ongoing efforts to train teams of students to index Ottoman and Chagatai Turkish manuscripts. Given the sheer size and scope of these global collections, FamilySearch also leverages machine learning to develop handwritten text recognition to assist in the indexing of Turkic record collections. In sum, this paper proposes a strategy for making all historical Turkic records accessible and useable to local and global researchers and suggests a framework for approaching similar language problems that afflict libraries around the world. Keywords: Ottoman Turkish, Chagatai Turkish, Arabic Script, Hadwritten Text Recognition
dc.identifier.urihttps://2025.ifla.org/
dc.identifier.urihttps://repository.ifla.org/handle/20.500.14598/4425
dc.language.isoen
dc.publisherInternational Federation of Library Associations and Institutions (IFLA)
dc.relation.ispartofseriesWorld Library and Information Congress (WLIC) ; 2025 - Astana, Kazakhstan - Uniting Knowledge, Building the Future
dc.rights.holderJonathan McCollum
dc.rights.licenseCC BY 4.0
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectAccessibility
dc.titleThe Language of the Conquerors: Opening the Lost World of the Turkic Empires for Genealogical Research
dc.typeArticle
dc.typeEvents Material
ifla.UnitSection::Library Services to Multicultural Populations Section
ifla.UnitSection::Local History and Genealogy Section
ifla.oPubId0

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
S1_2025_mccollum_en.pdf
Size:
270.23 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.28 KB
Format:
Item-specific license agreed upon to submission
Description: