Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
flexitok 's Collections
flexitok-mod-tokenizers-zero-padded
flexitok/mod-tokenizers
flexitok/fineweb2-hq-superset-tokenizers
Fineweb2-HQ-Tokenizers
merging-toksuite
baselines

baselines

updated 26 days ago
Upvote
-

  • flexitok/llama_baseline_english

    2B • Updated Jan 23

  • flexitok/llama_bpe_dropout_english

    2B • Updated Jan 23

  • flexitok/llama_baseline

    2B • Updated Feb 6

  • flexitok/llama_bpe_dropout

    2B • Updated Feb 6 • 1

  • flexitok/superset_albert

    2B • Updated Feb 6

  • flexitok/xglm_subword_regularization

    2B • Updated Feb 16

  • flexitok/llama_albert

    1B • Updated Feb 17

  • flexitok/llama_43k

    2B • Updated Feb 18 • 1

  • flexitok/superset_albert_w_xglm

    2B • Updated about 1 month ago

  • flexitok/supertokenizer-mod_tokenizers

    54.6M • Updated 26 days ago • 140

  • flexitok/mod-tokenizers-individual

    3.42M • Updated 27 days ago • 37

  • flexitok/mod-tokenizers-ltr_2digit

    3.46M • Updated 27 days ago • 79

  • flexitok/mod-tokenizers-rtl_2digit

    3.46M • Updated 27 days ago • 80

  • flexitok/mod-tokenizers-ltr_3digit

    3.93M • Updated 27 days ago • 84

  • flexitok/mod-tokenizers-ltr_4digit

    8.53M • Updated 27 days ago • 55

  • flexitok/mod-tokenizers-rtl_3digit

    3.93M • Updated 27 days ago • 78

  • flexitok/mod-tokenizers-rtl_4digit

    8.53M • Updated 27 days ago • 71

  • flexitok/mod-tokenizers-ltr_5digit

    54.6M • Updated 26 days ago • 26

  • flexitok/mod-tokenizers-rtl_5digit

    54.6M • Updated 26 days ago • 30
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs