INDEX
Explanations
verbs associated with action or movement
New Auto-Interp
Negative Logits
adolu
-0.17
pers
-0.16
orra
-0.15
kla
-0.15
ipple
-0.14
arrera
-0.14
Spar
-0.14
pas
-0.13
parsers
-0.13
à¸ģารà¹ĥà¸Ĭ
-0.13
POSITIVE LOGITS
atoon
0.16
undy
0.15
illard
0.15
esium
0.14
visiting
0.14
pool
0.14
assa
0.14
vi
0.14
zcze
0.13
Ñĩе
0.13
Activations Density 0.611%