beto-emoji

Fine-tunning BETO for emoji-prediction

HuggingFace

Installation

It requires the installation of pytorch, which depends on the system and whether there's a GPU. The library transformers. For the rest, run

pip install -r requirements.txt

Repository

Details with training and a use example are shown in github.com/camilocarvajalreyes/beto-emoji. A deeper analysis of this and other models on the full dataset can be found in github.com/furrutiav/data-mining-2022. We have used this model for a project for CC5205 Data Mining course.

Notebooks

Reproducibility

The Multilingual Emoji Prediction dataset (Barbieri et al. 2010) consists of tweets in English and Spanish that originally had a single emoji, which is later used as a tag. Test and trial sets can be downloaded here, but the train set needs to be downloaded using a twitter crawler. The goal is to predict that single emoji that was originally in the tweet using the text in it (out of a fixed set of possible emojis, 20 for English and 19 for Spanish).

Training parameters:

training_args = TrainingArguments(
    output_dir="./results",
    learning_rate=2e-5,
    per_device_train_batch_size=16,
    per_device_eval_batch_size=16,
    num_train_epochs=5,
    weight_decay=0.01
)

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
README.md		README.md
attention_visualisation.ipynb		attention_visualisation.ipynb
classifier_example_and_results.ipynb		classifier_example_and_results.ipynb
config.py		config.py
es_mapping.txt		es_mapping.txt
finetuning.ipynb		finetuning.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

beto-emoji

HuggingFace

Installation

Repository

Notebooks

Reproducibility

About

Releases

Packages

Contributors 2

Languages

camilocarvajalreyes/beto-emoji

Folders and files

Latest commit

History

Repository files navigation

beto-emoji

HuggingFace

Installation

Repository

Notebooks

Reproducibility

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages