Johnblick187 commited on
Commit
cc30de2
·
verified ·
1 Parent(s): 4e7c0cc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -67,4 +67,6 @@ SmartCoderMoE’s 2048 hidden size was chosen to natively align with:
67
  Coding. Lots of it. Uncensored.
68
 
69
  ## Note from the Creator
70
- As of the writing of this model card (Thursday, May 21st, 2026), the model is not finished. Multimodal expansion, as mentioned before, is on the way. As is a very unique calculation of how much of the original Starcoder knowledge remains. i will update the repo as i go. Feel free to use it while i build on it if you desire, and if you decide to do this and encounter any sort of issues woth it, please let me know so that i can fix it asap!
 
 
 
67
  Coding. Lots of it. Uncensored.
68
 
69
  ## Note from the Creator
70
+ As of the writing of this model card (Thursday, May 21st, 2026), the model is not finished. Multimodal expansion, as mentioned before, is on the way. As is a very unique calculation of how much of the original Starcoder knowledge remains. i will update the repo as i go. Feel free to use it while i build on it if you desire, and if you decide to do this and encounter any sort of issues woth it, please let me know so that i can fix it asap!
71
+ ## UPDATE!!!!!
72
+ several bugs detected in the model. due to a saving error, the model's weights were saved with incorrect key mapping. 1 bug causes from pretrained to fail to remap unless it is overridden at a higher level. due to this, everysingle weight is incorrectly saved. this with the fact that he hasnt been trained since conception means he NaNs during inference. Requires full fine tuning to use