INDEX
Explanations
punctuations and markers related to questions and exclamations
New Auto-Interp
Negative Logits
\grid
-0.15
yna
-0.15
lems
-0.14
emode
-0.14
ministries
-0.13
avid
-0.13
елик
-0.13
ç«ĭãģ¦
-0.13
anes
-0.13
frog
-0.13
POSITIVE LOGITS
Bek
0.16
Verd
0.14
QUIRE
0.14
Gim
0.14
CHANGE
0.13
pth
0.13
kas
0.13
ãĥ³ãĥģ
0.13
Mathf
0.13
Guerr
0.13
Activations Density 0.066%