Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
gbyuvd
/
FastChemTokenizer
like
0
Feature Extraction
qwen3
chemistry
tokenizer
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
FastChemTokenizer
9.96 MB
Ctrl+K
Ctrl+K
1 contributor
History:
41 commits
gbyuvd
Update FastChemTokenizerHF2.py
8cc4c16
verified
8 months ago
benchmark
Upload latent visualization notebook
8 months ago
bigsmiles-proto
Upload BigSMILES vocab
8 months ago
latent_space_plots
Upload benchmark script and set
8 months ago
selftok_core
Update to include SELFIES Tokenizer & Vocabs
8 months ago
selftok_wtails
Update to include SELFIES Tokenizer & Vocabs
8 months ago
smitok
First commit
8 months ago
smitok_core
Upload HF wrapper and smitok_core without tails
8 months ago
.gitattributes
Safe
1.71 kB
Upload benchmark script and set
8 months ago
CHANGELOG
Safe
193 Bytes
Tensor handling fix
8 months ago
FastChemTokenizer.py
Safe
23.1 kB
Tensor handling fix
8 months ago
FastChemTokenizerHF.py
Safe
24 kB
Proper full HF Compat
8 months ago
FastChemTokenizerHF2.py
25.5 kB
Update FastChemTokenizerHF2.py
8 months ago
README.md
Safe
12.2 kB
Update README.md
8 months ago
config.json
896 Bytes
Update config.json
8 months ago
requirements.txt
Safe
120 Bytes
Upload requirements.txt
8 months ago