INDEX
Explanations
punctuation marks and various conjunctions within the text
New Auto-Interp
Negative Logits
ouve
-0.17
Worker
-0.15
عÙĪØ¯
-0.15
_nh
-0.14
ugo
-0.14
tod
-0.14
Alv
-0.14
Ñģий
-0.14
ideo
-0.14
çŃ
-0.13
POSITIVE LOGITS
/Dk
0.15
_ur
0.15
.ss
0.15
bil
0.14
vertime
0.14
bên
0.14
vel
0.14
/DD
0.14
uml
0.13
orian
0.13
Activations Density 0.027%