INDEX
Explanations
references to the rhythm or speed of actions
New Auto-Interp
Negative Logits
aph
-0.16
adt
-0.15
alem
-0.15
ename
-0.15
zcze
-0.14
Ú
-0.14
ewith
-0.14
Alive
-0.14
Ñĥб
-0.14
iae
-0.14
POSITIVE LOGITS
rim
0.20
pace
0.18
inos
0.17
Pace
0.17
adil
0.16
itution
0.15
-paced
0.15
Ùĩر
0.15
prav
0.15
YTE
0.14
Activations Density 0.010%