INDEX
Explanations
commands or instructions related to transportation or movement
New Auto-Interp
Negative Logits
rem
-0.15
chan
-0.15
ille
-0.15
Hosp
-0.15
lla
-0.14
rap
-0.14
rap
-0.14
lick
-0.14
Jah
-0.14
adients
-0.14
POSITIVE LOGITS
istrovstvÃŃ
0.18
ovÃŃ
0.16
ecta
0.16
umat
0.15
ettings
0.14
Ø·ÙĨ
0.14
446
0.14
омеÑĤ
0.14
inge
0.14
κά
0.14
Activations Density 0.117%