INDEX
Explanations
phrases related to action or movement
phrases related to initiating actions or events
New Auto-Interp
Negative Logits
otten
-0.81
ever
-0.73
ums
-0.72
zzy
-0.72
asca
-0.69
ube
-0.69
uddy
-0.69
anche
-0.69
panel
-0.69
gor
-0.68
POSITIVE LOGITS
havoc
0.75
EMENT
0.65
ILCS
0.61
FI
0.60
defences
0.60
linem
0.59
attRot
0.59
whereby
0.59
HER
0.58
detachment
0.58
Activations Density 0.062%