INDEX
Explanations
actions related to bending or lowering one's body
New Auto-Interp
Negative Logits
urum
-0.17
riel
-0.16
SAT
-0.16
etat
-0.15
vala
-0.14
обÑĢеÑĤ
-0.14
icari
-0.14
vla
-0.14
systemd
-0.14
uchi
-0.14
POSITIVE LOGITS
bending
0.16
éĻį
0.15
ault
0.15
bend
0.15
alty
0.15
Authenticate
0.15
bent
0.14
Bow
0.14
Garrett
0.14
cis
0.14
Activations Density 0.072%