Hugging Face Forums
UTF-16 for datasets?
š¤Datasets
mariosasko
June 19, 2023, 7:16pm
2
You can pass
encoding="utf-16"
to the
load_dataset
call.
Random utf-8 errors from dataset
show post in topic
Related topics
Topic
Replies
Views
Activity
Datasets.load_datasets fails
š¤Datasets
12
1115
October 11, 2024
UniDecodeError: 'charmap' codec can't decode byte from Load_dataset
Beginners
0
87
December 5, 2024
Random utf-8 errors from dataset
Intermediate
10
3981
December 8, 2023
Cant create dataset with encoding
š¤Datasets
1
755
November 26, 2023
UnicodeDecodeError when loading Mulit Lingual text file
š¤Datasets
1
2408
April 7, 2022