-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Work With Other Datasets #4
Comments
I have the same question with antmarakis. Can you kindly help? |
Just sharing my experience with this repo, maybe this helps someone in the future: I used this repo for a {TEXT, LABEL} task with BERT models. Since neither this type of task nor this type of model is supported in the training section of this repo, I would recommend to first train any model on any dataset on your own (without using the code of this repo). While training, save somewhere the logits of each data instance together with the gold standard label and a unique identifier, as suggested by the authors (see "Note:" section). After training you can use train_dy_filtering, as explained here to generate DataMaps and to obtain coordinates for further data filtering. You just need to extend this line of code by any additional name, which you use from now on as task name. Then you can call |
fixing errors of this repo is more time consuming than extracting training dynamics from the model that is trained independently :) |
Hi! This looks like a very interesting tool, I am wondering if it would be easy to use on other datasets. I see only GLUE/NLI datasets are supported. Do you have any tips on how to use this on a simple {TEXT, LABEL} task? Thanks!
The text was updated successfully, but these errors were encountered: