INDEX
Explanations
sentiments of frustration and futility regarding societal issues
New Auto-Interp
Negative Logits
always
-0.69
alſo
-0.66
also
-0.63
always
-0.62
alfo
-0.60
tiež
-0.55
Always
-0.54
already
-0.52
already
-0.52
també
-0.51
POSITIVE LOGITS
ever
1.25
bothered
1.18
bother
1.14
siquiera
1.13
EVER
1.13
bothering
1.06
even
1.02
bothers
0.98
jemals
0.90
even
0.89
Activations Density 0.753%