Automatic extraction of data from 2-D plots in documents

Xiaonan Lu, James Z. Wang, Prasenjit Mitra, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

10 Citations (Scopus)

Abstract

Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.

Original languageEnglish
Title of host publicationProceedings of the International Conference on Document Analysis and Recognition, ICDAR
Pages188-192
Number of pages5
Volume1
DOIs
Publication statusPublished - 2007
Externally publishedYes
Event9th International Conference on Document Analysis and Recognition, ICDAR 2007 - Curitiba
Duration: 23 Sep 200726 Sep 2007

Other

Other9th International Conference on Document Analysis and Recognition, ICDAR 2007
CityCuritiba
Period23/9/0726/9/07

Fingerprint

Search engines
Industry
Experiments

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition

Cite this

Lu, X., Wang, J. Z., Mitra, P., & Giles, C. L. (2007). Automatic extraction of data from 2-D plots in documents. In Proceedings of the International Conference on Document Analysis and Recognition, ICDAR (Vol. 1, pp. 188-192). [4378701] https://doi.org/10.1109/ICDAR.2007.4378701

Automatic extraction of data from 2-D plots in documents. / Lu, Xiaonan; Wang, James Z.; Mitra, Prasenjit; Giles, C. Lee.

Proceedings of the International Conference on Document Analysis and Recognition, ICDAR. Vol. 1 2007. p. 188-192 4378701.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lu, X, Wang, JZ, Mitra, P & Giles, CL 2007, Automatic extraction of data from 2-D plots in documents. in Proceedings of the International Conference on Document Analysis and Recognition, ICDAR. vol. 1, 4378701, pp. 188-192, 9th International Conference on Document Analysis and Recognition, ICDAR 2007, Curitiba, 23/9/07. https://doi.org/10.1109/ICDAR.2007.4378701
Lu X, Wang JZ, Mitra P, Giles CL. Automatic extraction of data from 2-D plots in documents. In Proceedings of the International Conference on Document Analysis and Recognition, ICDAR. Vol. 1. 2007. p. 188-192. 4378701 https://doi.org/10.1109/ICDAR.2007.4378701
Lu, Xiaonan ; Wang, James Z. ; Mitra, Prasenjit ; Giles, C. Lee. / Automatic extraction of data from 2-D plots in documents. Proceedings of the International Conference on Document Analysis and Recognition, ICDAR. Vol. 1 2007. pp. 188-192
@inproceedings{87d573042d4a43deb2dbd64e2fd2a12e,
title = "Automatic extraction of data from 2-D plots in documents",
abstract = "Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.",
author = "Xiaonan Lu and Wang, {James Z.} and Prasenjit Mitra and Giles, {C. Lee}",
year = "2007",
doi = "10.1109/ICDAR.2007.4378701",
language = "English",
isbn = "0769528228",
volume = "1",
pages = "188--192",
booktitle = "Proceedings of the International Conference on Document Analysis and Recognition, ICDAR",

}

TY - GEN

T1 - Automatic extraction of data from 2-D plots in documents

AU - Lu, Xiaonan

AU - Wang, James Z.

AU - Mitra, Prasenjit

AU - Giles, C. Lee

PY - 2007

Y1 - 2007

N2 - Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.

AB - Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.

UR - http://www.scopus.com/inward/record.url?scp=51149119523&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=51149119523&partnerID=8YFLogxK

U2 - 10.1109/ICDAR.2007.4378701

DO - 10.1109/ICDAR.2007.4378701

M3 - Conference contribution

SN - 0769528228

SN - 9780769528229

VL - 1

SP - 188

EP - 192

BT - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR

ER -