2017
DOI: 10.1016/j.softx.2017.04.004
|View full text |Cite
|
Sign up to set email alerts
|

RCrawler: An R package for parallel web crawling and scraping

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
23
0
7

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 53 publications
(31 citation statements)
references
References 6 publications
0
23
0
7
Order By: Relevance
“…Estes bots, fazem rastreamento e raspagem de dados, utilizando duas técnicas, a saber: web crawling e web scraping. Estas técnicas podem ser usadas de forma simultânea ou como duas tarefas distintas [Khalil and Fakir 2017].…”
Section: Bots Da Webunclassified
“…Estes bots, fazem rastreamento e raspagem de dados, utilizando duas técnicas, a saber: web crawling e web scraping. Estas técnicas podem ser usadas de forma simultânea ou como duas tarefas distintas [Khalil and Fakir 2017].…”
Section: Bots Da Webunclassified
“…Web crawling is the most common technique to gather numerous online reviews in the text or picture format on social media [24]. Although crawlers can scrape the target dataset conveniently and continuously from any websites, they consume resources of visited systems and will cause the load and schedule issue [25]. To improve the data extraction efficiency, several crawling tools could be installed and utilized directly, such as Scrapy (scrapy.org), python-requests (docs.python-requests.org), and bazhuayu (www.bazhuayu.com).…”
Section: Social Media Miningmentioning
confidence: 99%
“…There are various data sources on the internet that can be used in the research process. The process of taking data or information from sites on the internet is called web scraping [2], [3], [4], [5], [6], [7], web extraction [8], [9], [10], [11], web harvesting [12], [13]. Web scraping has been used widely and for different purposes including online price comparison, weather data monitoring, website change detection, research, integrating data from multiple sources, extract offers and discounts, scrape job postings information from job portals, brand monitoring, collect government data and market analysis [14].…”
Section: Introductionmentioning
confidence: 99%