Hi! Considering the similarity between the Scandinavian languages, I suggest we might achieve a higher performance by utilising data from all of the languages. Just a suggestion. I made a project proposal here: Scandinavian RoBERTa. It’s using RoBERTa and not GPT-2 however, but I’m not too fuzzed about the model architecture, to be honest.
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| PreTrain GPT2 from scratch in Russian | 1 | 712 | July 1, 2021 | |
| PreTrain GPT2-Large (and/or GPT2-XL) from scratch in Portuguese | 0 | 769 | June 24, 2021 | |
| PreTrain GPT2 from scratch in Spanish | 12 | 2070 | July 1, 2021 | |
| Pretrain GPT-2 from scratch in Thai | 0 | 960 | July 18, 2021 | |
| PreTrain GPT2 from scratch in Bengali | 8 | 2519 | August 19, 2021 |