Code for Identifying Incorrect Labels #42

GSidiropoulos · 2022-09-21T10:31:38Z

Dear authors, thank you for sharing your work. I was wondering if you can also provide the code for identifying the incorrect labels. From what I understand the label corrections to produce a corrected version are given, however, the code to reproduce is not available.

Best regards,
Georgios

xuhdev · 2022-09-21T20:25:37Z

We identified the incorrect labels manually, while we utilized some tools to help speed up human review. I believe @frreiss has them available but I don't think they have changed the manual nature of identifying labels in any way.

GSidiropoulos · 2022-09-22T07:36:56Z

Thank you for your reply. My question mostly refers to the code you have here -> https://github.com/CODAIT/text-extensions-for-pandas/tree/master/tutorials/corpus. How can I obtain the 1054 flagged training samples? Do I have to combine the results from the CoNLL_4.ipynb and CoNLL_3.ipynb notebooks?

GSidiropoulos · 2022-11-29T22:31:40Z

Can you please clarify how we are supposed to combine the results of CoNLL_2.ipynb, CoNLL_3.ipynb, CoNLL_4.ipynb notebooks in order to obtain the results you report in the paper? In the paper you mention that " We
considered any label where fewer than 7 models agreed with the corpus label to be “flagged”". Is this the only condition you use in order to flag labels?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code for Identifying Incorrect Labels #42

Code for Identifying Incorrect Labels #42

GSidiropoulos commented Sep 21, 2022

xuhdev commented Sep 21, 2022

GSidiropoulos commented Sep 22, 2022

GSidiropoulos commented Nov 29, 2022 •

edited

Loading

Code for Identifying Incorrect Labels #42

Code for Identifying Incorrect Labels #42

Comments

GSidiropoulos commented Sep 21, 2022

xuhdev commented Sep 21, 2022

GSidiropoulos commented Sep 22, 2022

GSidiropoulos commented Nov 29, 2022 • edited Loading

GSidiropoulos commented Nov 29, 2022 •

edited

Loading