INDEX
Explanations
words related to social issues and criticism
New Auto-Interp
Negative Logits
RAW
-0.82
STDOUT
-0.66
tab
-0.64
apers
-0.64
kindred
-0.63
versions
-0.62
leted
-0.61
IAL
-0.61
inese
-0.60
actionDate
-0.59
POSITIVE LOGITS
terday
1.61
hhhh
1.00
hhh
0.90
hh
0.85
sir
0.83
pardon
0.79
yes
0.78
Yeah
0.76
yeah
0.73
soever
0.72
Activations Density 1.402%