Special tokens seem to be considered atomic. However, the implementation of special tokens is quite complex (it has been revised and changed over a long period of time), so it would be safer to search for information while working on it.
John6666
2
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Are special_tokens the only tokens guaranteed to be atomic? | 0 | 415 | March 3, 2021 | |
| Training a tokenizer clarification question | 3 | 61 | December 4, 2025 | |
| Use a pretrained ByteLevelBPETokenizer on text | 1 | 4093 | July 17, 2020 | |
| Tokenizer is splitting special token | 3 | 77 | June 30, 2025 | |
| Tokenizer splits up pre-split tokens | 9 | 6867 | February 9, 2024 |