Add DecoderTCR V0.3 weights (nested layout) + MIT model card

#2
by bl-2633 - opened
DecoderTCR-C-V0.3/300M.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18d47c169d0ce992152838b8229e1c682b0a681f55057b471dcdcbf07d2fcad9
3
+ size 3996365253
DecoderTCR-C-V0.3/600M.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4d3c84f30e3781c023e412eb3f3c098ef9fe64fd1423fd34df4b40b9dcbac0aa
3
+ size 2300241260
DecoderTCR-C-V0.3/6B.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b6bd3170b55a9a06d9c1472e248bd083e092afc7924439b444bf7379b74ab692
3
+ size 25408220584
DecoderTCR-ESM2-V0.1/3B_DecoderTCR.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fc369f2387dc922660492df86e38cfcff65812201929558cfece4e6e674d3553
3
+ size 11356181292
DecoderTCR-ESM2-V0.1/650M_DecoderTCR.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bf88513fabdfec601f2019cb13fb27a2980b370131a488cab3ab1e53bda483a5
3
+ size 2604318926
LICENSE ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ MIT License
2
+
3
+ Copyright (c) 2026 Chan Zuckerberg Biohub
4
+
5
+ Permission is hereby granted, free of charge, to any person obtaining a copy
6
+ of this software and associated documentation files (the "Software"), to deal
7
+ in the Software without restriction, including without limitation the rights
8
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
+ copies of the Software, and to permit persons to whom the Software is
10
+ furnished to do so, subject to the following conditions:
11
+
12
+ The above copyright notice and this permission notice shall be included in all
13
+ copies or substantial portions of the Software.
14
+
15
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21
+ SOFTWARE.
README.md CHANGED
@@ -2,14 +2,14 @@
2
  license: mit
3
  ---
4
 
5
- # DecoderTCR v0.1
6
- DecoderTCR is a protein language model for T-cell receptor (TCR) & peptide-MHC complexes. The model is based on the ESM2 model family.
7
 
8
  For Model Code and additional information on installation/usage please see [the associated GitHub repository](https://github.com/czbiohub-chi/DecoderTCR)
9
 
10
  ## Model Architecture
11
 
12
- DecoderTCR is built on a Transformer-based protein language model (ESM2 family).
13
 
14
  ### Core Architecture
15
 
@@ -39,12 +39,17 @@ The model follows the **ESM2** architecture, a deep Transformer encoder designed
39
  The model is initialized from a pretrained **ESM2 checkpoint** and further trained via continual pretraining with MLM objectives.
40
 
41
 
42
- ### Model Scale (Example Configurations)
43
 
44
- | Model Variant | Parameters | Layers | Hidden Dim | Attention Heads |
45
- | --- | --- | --- | --- | --- |
46
- | ESM2-650M | ~650M | 33 | 1280 | 20 |
47
- | ESM2-3B | ~3B | 36 | 2560 | 40 |
 
 
 
 
 
48
 
49
  ### Model Card Authors
50
 
@@ -172,7 +177,12 @@ Should you have any security or privacy issues or questions related to the servi
172
  ## Acknowledgements
173
 
174
  This model builds upon:
175
- - **ESM2** by Meta AI (Facebook Research) for the base protein language model
 
176
  - The broader computational biology and immunology research communities
177
 
178
  Special thanks to the developers and contributors of the ESM models and the open-source tools that made this work possible.
 
 
 
 
 
2
  license: mit
3
  ---
4
 
5
+ # DecoderTCR
6
+ DecoderTCR is a protein language model for T-cell receptor (TCR) & peptide-MHC complexes. The models are based on the ESM-2 and ESM-C model families.
7
 
8
  For Model Code and additional information on installation/usage please see [the associated GitHub repository](https://github.com/czbiohub-chi/DecoderTCR)
9
 
10
  ## Model Architecture
11
 
12
+ DecoderTCR is built on a Transformer-based protein language model (ESM-2 and ESM-C families).
13
 
14
  ### Core Architecture
15
 
 
39
  The model is initialized from a pretrained **ESM2 checkpoint** and further trained via continual pretraining with MLM objectives.
40
 
41
 
42
+ ### Released Models
43
 
44
+ This repository hosts two model lines. The **V0.3 ESM-C** line (`DecoderTCR-C`) is the current default; the **V0.1 ESM-2** line is retained for paper reproduction.
45
+
46
+ | Model | File | Backbone | Parameters | Layers | Hidden Dim | Attention Heads |
47
+ | --- | --- | --- | --- | --- | --- | --- |
48
+ | DecoderTCR-C 300M | `DecoderTCR-C-V0.3/300M.ckpt` | ESM-C | ~300M | 30 | 960 | 15 |
49
+ | DecoderTCR-C 600M (default) | `DecoderTCR-C-V0.3/600M.ckpt` | ESM-C | ~600M | 36 | 1152 | 18 |
50
+ | DecoderTCR-C 6B | `DecoderTCR-C-V0.3/6B.ckpt` | ESM-C | ~6B | 80 | 2560 | 40 |
51
+ | DecoderTCR 650M | `DecoderTCR-ESM2-V0.1/650M_DecoderTCR.ckpt` | ESM-2 | ~650M | 33 | 1280 | 20 |
52
+ | DecoderTCR 3B | `DecoderTCR-ESM2-V0.1/3B_DecoderTCR.ckpt` | ESM-2 | ~3B | 36 | 2560 | 40 |
53
 
54
  ### Model Card Authors
55
 
 
177
  ## Acknowledgements
178
 
179
  This model builds upon:
180
+ - **ESM-2** by Meta AI (Facebook Research) for the ESM-2 base protein language model
181
+ - **ESM-C** released by Chan Zuckerberg Biohub (https://github.com/Biohub/esm) for the ESM-C base protein language model
182
  - The broader computational biology and immunology research communities
183
 
184
  Special thanks to the developers and contributors of the ESM models and the open-source tools that made this work possible.
185
+
186
+ ## License
187
+
188
+ DecoderTCR code and all released weights are distributed under the [MIT license](https://github.com/czbiohub-chi/DecoderTCR/blob/main/LICENSE). The base backbones are likewise MIT: ESM-2 (Meta AI) and ESM-C (Chan Zuckerberg Biohub, https://github.com/Biohub/esm).