INDEX
Explanations
past tense verbs
statements related to accountability and consequences
New Auto-Interp
Negative Logits
ELS
-0.57
thodox
-0.55
Il
-0.52
los
-0.52
Bridge
-0.49
airs
-0.49
Eastern
-0.48
updated
-0.48
Balt
-0.48
Ń
-0.48
POSITIVE LOGITS
yourself
1.30
yourselves
1.19
Yourself
0.89
your
0.76
YOUR
0.70
poke
0.67
your
0.63
Your
0.61
panties
0.60
smack
0.59
Activations Density 0.872%