A New Annotation Method and Dataset for Layout Analysis of Long Documents
Published in Companion Proceedings of the ACM Web Conference 2023, 2023
Parsing long documents, such as books, theses, and dissertations, is an important component of information extraction from scholarly documents…. Read more
Recommended citation: Aman Ahuja, Kevin Dinh,Brian Dinh, William A. Ingram, and Edward Fox. 2023. A New Annotation Method and Dataset for Layout Analysis of Long Documents. In Companion Proceedings of the ACM Web Conference 2023 (Austin, TX, USA) (WWW ’23 Companion). Association for Computing Machinery, New York, NY, USA, 834—-842. https://doi.org/10.1145/3543873.3587609. https://dl.acm.org/doi/abs/10.1145/3543873.3587609