|
PTQ INT8 via TFLiteConverter — encoder-decoder seq2seq model loses encoder context entirely after conversion
|
|
0
|
4
|
April 27, 2026
|
|
Custom batches in sentence-transformers for MultipleNegativesRankingLoss
|
|
0
|
19
|
April 27, 2026
|
|
CPU offloading error scenario
|
|
11
|
89
|
April 27, 2026
|
|
Gemma 3 12B: 4-bit Quantization failing/ignored in Transformers v5.1.0 (Gemma3ForConditionalGeneration)
|
|
11
|
274
|
April 24, 2026
|
|
Why am I facing this Error while running this code
|
|
1
|
24
|
April 23, 2026
|
|
What are the best tutorials to learn Transformers step by step?
|
|
2
|
101
|
April 20, 2026
|
|
LLM Course code errors
|
|
8
|
217
|
April 17, 2026
|
|
Independent researcher looking for technical feedback on a paper about a revision-capable language model
|
|
0
|
32
|
April 17, 2026
|
|
I developed an experimental Graph-Native Artificial Brain engine
|
|
3
|
42
|
April 16, 2026
|
|
Why this BERTScore has a high precision?
|
|
1
|
21
|
April 16, 2026
|
|
Fine-tuning Gemma-4-E2B on MacBook M3
|
|
4
|
279
|
April 14, 2026
|
|
Current State and Future of "Integer-Only" LLM Inference (Non-Floating Point)
|
|
1
|
72
|
April 14, 2026
|
|
Continous increase in Memory usage
|
|
17
|
2225
|
April 14, 2026
|
|
Peft 0.18.1 crashing when fine-tuning - Part 2
|
|
2
|
27
|
April 14, 2026
|
|
Peft 0.18.1 crashing when fine-tuning
|
|
4
|
115
|
April 13, 2026
|
|
[Guide] How I debugged T5 fine-tuning for a medical diagnosis task
|
|
1
|
38
|
April 11, 2026
|
|
Runtime Layer on modeling_utils.py (No Source Changes)
|
|
0
|
40
|
April 11, 2026
|
|
What happened to DeepSite 2.0
|
|
3
|
45
|
April 9, 2026
|
|
Deprecation of assistant_only_loss
|
|
3
|
82
|
April 8, 2026
|
|
Semantic matching in graph space without matrix computation and hallucinations and no GPU
|
|
0
|
26
|
April 6, 2026
|
|
How to decode CSM tokens into audio tensors for streaming
|
|
2
|
79
|
April 5, 2026
|
|
How to get list of downloaded models names?
|
|
7
|
5882
|
April 5, 2026
|
|
Webhook usecase
|
|
1
|
22
|
April 2, 2026
|
|
Spaces not working with zerogpu on the paid planm
|
|
2
|
15
|
April 1, 2026
|
|
Pipeline tutorial, summarization doesn't work
|
|
3
|
93
|
March 31, 2026
|
|
Transformer for asynchronous multi-stream image time-series with online prediction?
|
|
1
|
20
|
March 30, 2026
|
|
Found the fix for memory not being freed when switching models on Linux (it's not Python or PyTorch)
|
|
2
|
92
|
March 29, 2026
|
|
Wave Field LLM — O(n log n) attention via wave equation dynamics, within 5% of standard transformer
|
|
4
|
6044
|
March 29, 2026
|
|
Mes Spaces restent bloqués sur “Starting” malgré abonnement Pro et hébergement GPU
|
|
5
|
111
|
March 28, 2026
|
|
How do I get started with Hugging Face Transformers as a beginner?
|
|
0
|
52
|
March 27, 2026
|