Add DecoderTCR V0.3 weights (nested layout) + MIT model card
#2
by bl-2633 - opened
- DecoderTCR-C-V0.3/300M.ckpt +3 -0
- DecoderTCR-C-V0.3/600M.ckpt +3 -0
- DecoderTCR-C-V0.3/6B.ckpt +3 -0
- DecoderTCR-ESM2-V0.1/3B_DecoderTCR.ckpt +3 -0
- DecoderTCR-ESM2-V0.1/650M_DecoderTCR.ckpt +3 -0
- LICENSE +21 -0
- README.md +19 -9
DecoderTCR-C-V0.3/300M.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:18d47c169d0ce992152838b8229e1c682b0a681f55057b471dcdcbf07d2fcad9
|
| 3 |
+
size 3996365253
|
DecoderTCR-C-V0.3/600M.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4d3c84f30e3781c023e412eb3f3c098ef9fe64fd1423fd34df4b40b9dcbac0aa
|
| 3 |
+
size 2300241260
|
DecoderTCR-C-V0.3/6B.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b6bd3170b55a9a06d9c1472e248bd083e092afc7924439b444bf7379b74ab692
|
| 3 |
+
size 25408220584
|
DecoderTCR-ESM2-V0.1/3B_DecoderTCR.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fc369f2387dc922660492df86e38cfcff65812201929558cfece4e6e674d3553
|
| 3 |
+
size 11356181292
|
DecoderTCR-ESM2-V0.1/650M_DecoderTCR.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bf88513fabdfec601f2019cb13fb27a2980b370131a488cab3ab1e53bda483a5
|
| 3 |
+
size 2604318926
|
LICENSE
ADDED
|
@@ -0,0 +1,21 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
MIT License
|
| 2 |
+
|
| 3 |
+
Copyright (c) 2026 Chan Zuckerberg Biohub
|
| 4 |
+
|
| 5 |
+
Permission is hereby granted, free of charge, to any person obtaining a copy
|
| 6 |
+
of this software and associated documentation files (the "Software"), to deal
|
| 7 |
+
in the Software without restriction, including without limitation the rights
|
| 8 |
+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
| 9 |
+
copies of the Software, and to permit persons to whom the Software is
|
| 10 |
+
furnished to do so, subject to the following conditions:
|
| 11 |
+
|
| 12 |
+
The above copyright notice and this permission notice shall be included in all
|
| 13 |
+
copies or substantial portions of the Software.
|
| 14 |
+
|
| 15 |
+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
| 16 |
+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
| 17 |
+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
| 18 |
+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
| 19 |
+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
| 20 |
+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
| 21 |
+
SOFTWARE.
|
README.md
CHANGED
|
@@ -2,14 +2,14 @@
|
|
| 2 |
license: mit
|
| 3 |
---
|
| 4 |
|
| 5 |
-
# DecoderTCR
|
| 6 |
-
DecoderTCR is a protein language model for T-cell receptor (TCR) & peptide-MHC complexes. The
|
| 7 |
|
| 8 |
For Model Code and additional information on installation/usage please see [the associated GitHub repository](https://github.com/czbiohub-chi/DecoderTCR)
|
| 9 |
|
| 10 |
## Model Architecture
|
| 11 |
|
| 12 |
-
DecoderTCR is built on a Transformer-based protein language model (
|
| 13 |
|
| 14 |
### Core Architecture
|
| 15 |
|
|
@@ -39,12 +39,17 @@ The model follows the **ESM2** architecture, a deep Transformer encoder designed
|
|
| 39 |
The model is initialized from a pretrained **ESM2 checkpoint** and further trained via continual pretraining with MLM objectives.
|
| 40 |
|
| 41 |
|
| 42 |
-
###
|
| 43 |
|
| 44 |
-
|
| 45 |
-
|
| 46 |
-
|
|
| 47 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 48 |
|
| 49 |
### Model Card Authors
|
| 50 |
|
|
@@ -172,7 +177,12 @@ Should you have any security or privacy issues or questions related to the servi
|
|
| 172 |
## Acknowledgements
|
| 173 |
|
| 174 |
This model builds upon:
|
| 175 |
-
- **
|
|
|
|
| 176 |
- The broader computational biology and immunology research communities
|
| 177 |
|
| 178 |
Special thanks to the developers and contributors of the ESM models and the open-source tools that made this work possible.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
license: mit
|
| 3 |
---
|
| 4 |
|
| 5 |
+
# DecoderTCR
|
| 6 |
+
DecoderTCR is a protein language model for T-cell receptor (TCR) & peptide-MHC complexes. The models are based on the ESM-2 and ESM-C model families.
|
| 7 |
|
| 8 |
For Model Code and additional information on installation/usage please see [the associated GitHub repository](https://github.com/czbiohub-chi/DecoderTCR)
|
| 9 |
|
| 10 |
## Model Architecture
|
| 11 |
|
| 12 |
+
DecoderTCR is built on a Transformer-based protein language model (ESM-2 and ESM-C families).
|
| 13 |
|
| 14 |
### Core Architecture
|
| 15 |
|
|
|
|
| 39 |
The model is initialized from a pretrained **ESM2 checkpoint** and further trained via continual pretraining with MLM objectives.
|
| 40 |
|
| 41 |
|
| 42 |
+
### Released Models
|
| 43 |
|
| 44 |
+
This repository hosts two model lines. The **V0.3 ESM-C** line (`DecoderTCR-C`) is the current default; the **V0.1 ESM-2** line is retained for paper reproduction.
|
| 45 |
+
|
| 46 |
+
| Model | File | Backbone | Parameters | Layers | Hidden Dim | Attention Heads |
|
| 47 |
+
| --- | --- | --- | --- | --- | --- | --- |
|
| 48 |
+
| DecoderTCR-C 300M | `DecoderTCR-C-V0.3/300M.ckpt` | ESM-C | ~300M | 30 | 960 | 15 |
|
| 49 |
+
| DecoderTCR-C 600M (default) | `DecoderTCR-C-V0.3/600M.ckpt` | ESM-C | ~600M | 36 | 1152 | 18 |
|
| 50 |
+
| DecoderTCR-C 6B | `DecoderTCR-C-V0.3/6B.ckpt` | ESM-C | ~6B | 80 | 2560 | 40 |
|
| 51 |
+
| DecoderTCR 650M | `DecoderTCR-ESM2-V0.1/650M_DecoderTCR.ckpt` | ESM-2 | ~650M | 33 | 1280 | 20 |
|
| 52 |
+
| DecoderTCR 3B | `DecoderTCR-ESM2-V0.1/3B_DecoderTCR.ckpt` | ESM-2 | ~3B | 36 | 2560 | 40 |
|
| 53 |
|
| 54 |
### Model Card Authors
|
| 55 |
|
|
|
|
| 177 |
## Acknowledgements
|
| 178 |
|
| 179 |
This model builds upon:
|
| 180 |
+
- **ESM-2** by Meta AI (Facebook Research) for the ESM-2 base protein language model
|
| 181 |
+
- **ESM-C** released by Chan Zuckerberg Biohub (https://github.com/Biohub/esm) for the ESM-C base protein language model
|
| 182 |
- The broader computational biology and immunology research communities
|
| 183 |
|
| 184 |
Special thanks to the developers and contributors of the ESM models and the open-source tools that made this work possible.
|
| 185 |
+
|
| 186 |
+
## License
|
| 187 |
+
|
| 188 |
+
DecoderTCR code and all released weights are distributed under the [MIT license](https://github.com/czbiohub-chi/DecoderTCR/blob/main/LICENSE). The base backbones are likewise MIT: ESM-2 (Meta AI) and ESM-C (Chan Zuckerberg Biohub, https://github.com/Biohub/esm).
|