2019
DOI: 10.1016/j.dib.2018.12.063
|View full text |Cite
|
Sign up to set email alerts
|

PADI-web corpus: Labeled textual data in animal health domain

Abstract: Monitoring animal health worldwide, especially the early detection of outbreaks of emerging pathogens, is one of the means of preventing the introduction of infectious diseases in countries (Collier et al., 2008) [3]. In this context, we developed PADI-web, a Platform for Automated extraction of animal Disease Information from the Web (Arsevska et al., 2016, 2018). PADI-web is a text-mining tool that automatically detects, categorizes and extracts disease outbreak information from Web news articles. PADI-web c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 3 publications
0
4
0
Order By: Relevance
“…We found 7 end‐user tools and 76 published methods papers. For one tool (PADI‐web), we aggregated two full texts into one tool description 29,30 . Those tools automate data extraction across different types of text (i.e., patents, news, trial registrations) and across different media (i.e., digitalised text and videos).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We found 7 end‐user tools and 76 published methods papers. For one tool (PADI‐web), we aggregated two full texts into one tool description 29,30 . Those tools automate data extraction across different types of text (i.e., patents, news, trial registrations) and across different media (i.e., digitalised text and videos).…”
Section: Resultsmentioning
confidence: 99%
“…For one tool (PADI-web), we aggregated two full texts into one tool description. 29,30 Those tools automate data extraction across different types of text (i.e., patents, news, trial registrations) and across different media (i.e., digitalised text and videos). For PADI-web, we found accessible web-deployments, giving users the opportunity to test and use the tool.…”
Section: Tool Descriptionsmentioning
confidence: 99%
“…could be used (see Section 2.2 ). We have used a labeled dataset [ 41 ] to learn and integrate a domain-specific model, which is able to detect host and disease names, as well as numbers of cases related to an outbreak. Both types of entities (i.e.…”
Section: Methodsmentioning
confidence: 99%
“…The DK-Clarin [12] and PADI-web corpus [13] have relatively small amounts of agricultural data. DK-Clarin has 45,000 documents, but the corpus covers other domains than agriculture.…”
Section: Corpusmentioning
confidence: 99%