Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis 2023
DOI: 10.1145/3597926.3598060
|View full text |Cite
|
Sign up to set email alerts
|

Who Judges the Judge: An Empirical Study on Online Judge Tests

Abstract: Online Judge platforms play a pivotal role in education, competitive programming, recruitment, career training, and large language model training. They rely on predefined test suites to judge the correctness of submitted solutions. It is therefore important that the solution judgement is reliable and free from potentially misleading false positives (i.e., incorrect solutions that are judged as correct). In this paper, we conduct an empirical study of 939 coding problems with 541,552 solutions, all of which are… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 57 publications
0
0
0
Order By: Relevance