This dataset contains three benchmark datasets as part of the scholarly output of an ICDAR 2021 paper: 

Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee, Han-Wei Shen, Jian Wu, and C. Lee Giles, Document Domain Randomization for Deep Learning Document Layout Extraction, 16th International Conference on Document Analysis and Recognition (ICDAR) 2021. September 5-10, Lausanne, Switzerland. 

This dataset contains nine class lables: abstract, algorithm, author, body text, caption, equation, figure, table, and title.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

[1] Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee , Han-Wei Shen, Jian Wu, C. Lee Giles, "Three Benchmark Datasets for Scholarly Article Layout Analysis", IEEE Dataport, 2021. [Online]. Available: http://dx.doi.org/10.21227/326q-bf39. Accessed: Feb. 10, 2025.
@data{326q-bf39-21,
doi = {10.21227/326q-bf39},
url = {http://dx.doi.org/10.21227/326q-bf39},
author = {Meng Ling; Jian Chen; Torsten Möller; Petra Isenberg; Tobias Isenberg; Michael Sedlmair; Robert S. Laramee ; Han-Wei Shen; Jian Wu; C. Lee Giles },
publisher = {IEEE Dataport},
title = {Three Benchmark Datasets for Scholarly Article Layout Analysis},
year = {2021} }
TY - DATA
T1 - Three Benchmark Datasets for Scholarly Article Layout Analysis
AU - Meng Ling; Jian Chen; Torsten Möller; Petra Isenberg; Tobias Isenberg; Michael Sedlmair; Robert S. Laramee ; Han-Wei Shen; Jian Wu; C. Lee Giles
PY - 2021
PB - IEEE Dataport
UR - 10.21227/326q-bf39
ER -
Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee , Han-Wei Shen, Jian Wu, C. Lee Giles. (2021). Three Benchmark Datasets for Scholarly Article Layout Analysis. IEEE Dataport. http://dx.doi.org/10.21227/326q-bf39
Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee , Han-Wei Shen, Jian Wu, C. Lee Giles, 2021. Three Benchmark Datasets for Scholarly Article Layout Analysis. Available at: http://dx.doi.org/10.21227/326q-bf39.
Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee , Han-Wei Shen, Jian Wu, C. Lee Giles. (2021). "Three Benchmark Datasets for Scholarly Article Layout Analysis." Web.
1. Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee , Han-Wei Shen, Jian Wu, C. Lee Giles. Three Benchmark Datasets for Scholarly Article Layout Analysis [Internet]. IEEE Dataport; 2021. Available from : http://dx.doi.org/10.21227/326q-bf39
Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee , Han-Wei Shen, Jian Wu, C. Lee Giles. "Three Benchmark Datasets for Scholarly Article Layout Analysis." doi: 10.21227/326q-bf39