INDEX
Explanations
hexadecimal values in code
New Auto-Interp
Negative Logits
hog
-0.16
aur
-0.15
NCY
-0.14
ernes
-0.14
ÑĤаб
-0.14
гл
-0.14
Rough
-0.14
Moy
-0.14
pha
-0.14
Ãĺ
-0.14
POSITIVE LOGITS
_managed
0.15
issors
0.14
arket
0.14
617
0.14
æ¯
0.14
elli
0.14
ấ
0.14
zel
0.13
616
0.13
Increment
0.13
Activations Density 0.005%