INDEX
Explanations
verbs related to power or control
references to power and control
New Auto-Interp
Negative Logits
akeru
-0.77
ocol
-0.62
antz
-0.59
lapse
-0.59
nih
-0.59
chrom
-0.57
fail
-0.57
Freddie
-0.55
otide
-0.55
Saving
-0.55
POSITIVE LOGITS
wielded
1.17
wield
0.98
wielding
0.92
¯
0.88
riors
0.85
levers
0.83
hold
0.83
chairs
0.79
halla
0.79
tip
0.76
Activations Density 0.024%