PeytonT/paper_universe_interactive
Viewer • Updated • 1.25M • 336
How to use PeytonT/1m-paper-embedding-model-lite-onnx with sentence-transformers:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("PeytonT/1m-paper-embedding-model-lite-onnx")
sentences = [
"The weather is lovely today.",
"It's so sunny outside!",
"He drove to the stadium."
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]This is a compact student encoder distilled from PeytonT/1m-paper-embedding-model over the 1M paper corpus.
It predicts the same 768-dimensional, L2-normalized M1 embedding space used by the Research Library paper universe.
| Path | Description |
|---|---|
onnx/model.onnx |
Float ONNX student encoder. |
onnx/model.int8.onnx |
Dynamically quantized int8 ONNX model for browser/WASM inference. |
tokenizer/ |
Student tokenizer files. |
manifest.json |
Export metadata consumed by the static viewer. |
exports/huggingface/paper_universe_interactive_v1/semantic_m1/papers_all.emb.i8exports/huggingface/paper_universe_interactive_v1/interactive/papers_all.json1,000,0000.73510.00069033.7 MBembedding[batch, 768]128The output is compatible with the existing semantic_m1/*.emb.i8 paper-vector files.
Base model
google/bert_uncased_L-4_H-256_A-4