Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery &Amp; Data Mining 2020
DOI: 10.1145/3394486.3403303
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Validation of Textual Attribute Values in E-commerce Catalog by Learning with Limited Labeled Data

Abstract: Product catalogs are valuable resources for eCommerce website. In the catalog, a product is associated with multiple attributes whose values are short texts, such as product name, brand, functionality and flavor. Usually individual retailers self-report these key values, * Most of the work was conducted when the author was interning at Amazon.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 21 publications
(22 reference statements)
0
3
0
Order By: Relevance
“…Meta-learning has long been proposed as a form of learning that would allow systems to systematically build up and re-use knowledge across different but related tasks [34,36,37]. MAML [6] is to learn model initialization parameters that are used to rapidly learn novel tasks with a small set of labeled data.…”
Section: Few-shot Learningmentioning
confidence: 99%
“…Meta-learning has long been proposed as a form of learning that would allow systems to systematically build up and re-use knowledge across different but related tasks [34,36,37]. MAML [6] is to learn model initialization parameters that are used to rapidly learn novel tasks with a small set of labeled data.…”
Section: Few-shot Learningmentioning
confidence: 99%
“…The MAG team used machine curation to solve the problem of uniquely identifying authors and their publications (Wang et.al 2020). A human curation strategy advocates setting up standards such as Document Object Identifier (DOI) for uniquely identifying publications, and Open Researcher and Contributor ID (ORCID) for uniquely identifying authors.…”
Section: Contrasting Perspectivesmentioning
confidence: 99%
“…The next level of scaling aimed to reduce modeling cost through AutoML and automatic cleaning techniques (Wang et al. 2020) so that manual tuning for each knowledge extraction model could be significantly reduced. Scaling further required reducing the total number of models required for the variety of knowledge to be extracted, which was achieved through transfer learning techniques such that a model can extract knowledge for multiple attributes and for multiple domains (Karamanolakis, Ma, and Dong 2020).…”
Section: Contrasting Perspectivesmentioning
confidence: 99%