INDEX
Explanations
gerunds and actions involving movement or manipulation
New Auto-Interp
Negative Logits
ilon
-0.15
fak
-0.15
erap
-0.14
ÙĦاÙģ
-0.14
alion
-0.14
isma
-0.13
çħ
-0.13
voie
-0.13
lesen
-0.13
ncia
-0.13
POSITIVE LOGITS
around
1.16
around
1.03
Around
1.00
Around
0.96
-around
0.88
autour
0.75
вокÑĢÑĥг
0.60
kolem
0.59
ØŃÙĪÙĦ
0.49
около
0.47
Activations Density 0.185%