- 1
- 2
- Source: Analyzed Layout and Text Object
A Legend (2024)
WWII From Space (2012)
2001: A Space Odyssey (1968)
Avengers: Infinity War (2018)
Mechamato Movie (2022)
Azor (2021)
No More Posts Available.
No more pages to load.
Analyzed Layout and Text Object (ALTO) is an open XML Schema developed by the EU-funded project called METAe.
The standard was initially developed for the description of text OCR and layout information of pages for digitized material. The goal was to describe the layout and text in a form to be able to reconstruct the original appearance based on the digitized information - similar to the approach of a lossless image saving operation.
ALTO is often used in combination with Metadata Encoding and Transmission Standard (METS) for the description of the whole digitized object and creation of references across the ALTO files, e.g. reading sequence description.
The standard is hosted by the Library of Congress since 2010 and maintained by the Editorial Board initialized at the same time.
In the time from the final version of the ALTO standard in June 2004 (version 1.0) ALTO was maintained by CCS CCS Content Conversion Specialists GmbH, Hamburg up to version 1.4.
Structure
An ALTO file consists of three major sections as children of the root
<Layout> section contains the content information. It is subdivided into
Software support
ABBYY FineReader
eScriptorium
Kitodo
Tesseract OCR
Transkribus
See also
Metadata Encoding and Transmission Standard (METS)
Dublin Core, an ISO metadata standard
Preservation Metadata: Implementation Strategies (PREMIS)
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)
hOCR
PAGE (XML)
References
External links
ALTO (Analyzed Layout and Text Object) standards on Library of Congress website
https://altoxml.github.io/ resp. https://github.com/altoxml ALTOxml on GitHub
More info about METS/ALTO by CCS GmbH
METS ALTO Introduction by CCS GmbH Archived 2014-09-04 at the Wayback Machine
XSLT-Transformations from and to ALTO