Adaptive information extraction from unstructured documents

Dezsenyi, Csaba; Dobrowiecki, Tadeusz; Mészáros, Tamás

doi:10.1504/ijiids.2007.014948

Cited by 3 publications

(3 citation statements)

References 20 publications

(18 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Natural language processing (NLP) is a constant reference in most publications (Hassan and Le 2020). Sometimes their proposals ask for structured documents and, when not, they need to transform documents into structured data (Dezsenyi et al 2007;Oro and Ruffolo 2008). Other times they need to convert the original PDF files into HTML and text format files to be able to proceed ( The documents analyzed propose algorithmbased systems and agents with rules to query document databases, although it is common to find unsolved problems when there are heterogeneous data sources (Seng and Lai 2010).…”

Section: Literature Reviewmentioning

confidence: 99%

See 1 more Smart Citation

Intelligent information extraction from scholarly document databases

Fernandez¹

2020

JISIB

View full text Add to dashboard Cite

Extracting knowledge from big document databases has long been a challenge.Most researchers do a literature review and manage their document databases with tools thatjust provide a bibliography and when retrieving information (a list of concepts and ideas), thereis a severe lack of functionality. Researchers do need to extract specific information from theirscholarly document databases depending on their predefined breakdown structure. Thosedatabases usually contain a few hundred documents, information requirements are distinct ineach research project, and technique algorithms are not always the answer. As most retrievingand information extraction algorithms require manual training, supervision, and tuning, itcould be shorter and more efficient to do it by hand and dedicate time and effort to perform aneffective semantic search list definition that is the key to obtain the desired results. A robustrelative importance index definition is the final step to obtain a ranked importance concept listthat will be helpful both to measure trends and to find a quick path to the most appropriatepaper in each case.

show abstract

Section: Literature Reviewmentioning

confidence: 99%

“…x x x x x x x x x x Journal Article Dezsenyi, C., Dobrowiecki, T. P., and Meszaros, T. 2007 Adaptive information extraction from unstructured documents Sistema. Transformación documento a estructurado.…”

unclassified

Intelligent information extraction from scholarly document databases

Fernandez¹

2020

JISIB

View full text Add to dashboard Cite

show abstract

“…Natural language processing (NLP) is a constant reference in most publications (Hassan and Le 2020). Sometimes their proposals ask for structured documents and, when not, they need to transform documents into structured data (Dezsenyi et al 2007;Oro and Ruffolo 2008). Other times they need to convert the original PDF files into HTML and text format files to be able to proceed (Hassan and Baumgartner 2005a;Rizvi et al 2018;Seng and Lai 2010).…”

Section: Literature Reviewmentioning

confidence: 99%

Untitled

2020

JISIB

View full text Add to dashboard Cite

The journal includes articles within areas such as Competitive Intelligence, Business Intelligence, Market Intelligence, Scientific and Technical Intelligence and Geo-economics. This means that the journal has a managerial as well as an applied technical side (Information Systems), as these are now well integrated in real life Business Intelligence solutions. By focusing on business applications, this journal does not compete directly with the journals that deal with library sciences or state and military intelligence studies. Topics within the selected study areas should show clear practical implications.

show abstract

Incremental semantic web retrieval model based on web service

Liao

Guangping

2017

International Journal of Computers and Applications

View full text Add to dashboard Cite

Adaptive information extraction from unstructured documents

Cited by 3 publications

References 20 publications

Intelligent information extraction from scholarly document databases

Intelligent information extraction from scholarly document databases

Untitled

Incremental semantic web retrieval model based on web service

Contact Info

Product

Resources

About