INDEX
Explanations
references to significant numbers or numerical values
New Auto-Interp
Negative Logits
illac
-0.14
Caps
-0.14
ieg
-0.13
Carm
-0.13
Cone
-0.13
Lah
-0.13
Blo
-0.13
мил
-0.13
olas
-0.13
uis
-0.13
POSITIVE LOGITS
oz
0.21
elder
0.19
Jury
0.18
Mong
0.18
jury
0.18
yok
0.18
stable
0.18
éĥ¨å±ĭ
0.17
stable
0.17
NSK
0.17
Activations Density 0.006%