PreTrain GPT2 from scratch in Swedish

saattrupdan · June 28, 2021, 9:35pm

Hi! Considering the similarity between the Scandinavian languages, I suggest we might achieve a higher performance by utilising data from all of the languages. Just a suggestion. I made a project proposal here: Scandinavian RoBERTa. It’s using RoBERTa and not GPT-2 however, but I’m not too fuzzed about the model architecture, to be honest.

Topic		Replies	Views
PreTrain GPT2 from scratch in Russian Flax/JAX Projects	1	712	July 1, 2021
PreTrain GPT2-Large (and/or GPT2-XL) from scratch in Portuguese Flax/JAX Projects	0	769	June 24, 2021
PreTrain GPT2 from scratch in Spanish Flax/JAX Projects	12	2070	July 1, 2021
Pretrain GPT-2 from scratch in Thai Flax/JAX Projects	0	960	July 18, 2021
PreTrain GPT2 from scratch in Bengali Flax/JAX Projects	8	2519	August 19, 2021

PreTrain GPT2 from scratch in Swedish

Related topics