2020
DOI: 10.2478/dim-2020-0003
|View full text |Cite
|
Sign up to set email alerts
|

Teaching Natural Language Processing through Big Data Text Summarization with Problem-Based Learning

Abstract: Natural language processing (NLP) covers a large number of topics and tasks related to data and information management, leading to a complex and challenging teaching process. Meanwhile, problem-based learning is a teaching technique specifically designed to motivate students to learn efficiently, work collaboratively, and communicate effectively. With this aim, we developed a problem-based learning course for both undergraduate and graduate students to teach NLP. We provided student teams with big data sets, b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0
1

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 43 publications
0
9
0
1
Order By: Relevance
“…Big data also symbolize the competitive ability of national science and technology. It promotes the decision-making mode change, proving that data determine success or failure [ 18 , 19 ]. The application of big data increases the number of information dissemination carriers and provides users with more options and data potential in the practical application process.…”
Section: Methodsmentioning
confidence: 99%
“…Big data also symbolize the competitive ability of national science and technology. It promotes the decision-making mode change, proving that data determine success or failure [ 18 , 19 ]. The application of big data increases the number of information dissemination carriers and provides users with more options and data potential in the practical application process.…”
Section: Methodsmentioning
confidence: 99%
“…Chinese Word Embedding. Different from the English language where words are usually taken as basic semantic units, Chinese words have complicated composition structures revealing their semantic meanings (Li et al, 2020(Li et al, , 2021. More specifically, a Chinese word is often composed of several characters, and most of the characters themselves can be further divided into components such as radicals.…”
Section: Related Workmentioning
confidence: 99%
“…Generative language models are getting bigger: from ELMo's release in 2018 with 94M parameters (Joshi et al, 2018) to Megatron-Turing NLG in 2022 with 530Bn (Smith et al, 2022), there has been approximately a tenfold annual increase in parameters. The growing capabilities of these models have supported their adoption in many downstream tasks, from text summarisation (Li et al, 2020) and weather reporting (Gatt and Krahmer, 2018) to writing code . However, there are various associated risks, such as privacy erosion, copyright infringement, environmental harms and negative stereotyping of social groups (Margoni, 2019;Feyisetan et al, 2020;Bommasani et al, 2021;Weidinger et al, 2021).…”
Section: Introductionmentioning
confidence: 99%
“…In addition, text processing techniques (such as regular expressions, text normalization, and edit distance) that may be used in primary or secondary school are also introduced. Similarly, Li et al [13] proposed a big data text summarization course for undergraduate and graduate students. In this course, the authors let their students write their own program codes that perform text summarization.…”
Section: Related Work a Text Processing Educationmentioning
confidence: 99%