INDEX
Explanations
mentions of authority figures such as government, professors, or critics
conjunctions particularly the word "and."
New Auto-Interp
Negative Logits
zar
-0.72
cheon
-0.71
aldo
-0.71
FML
-0.68
js
-0.64
jit
-0.63
bumped
-0.63
Redditor
-0.63
fur
-0.62
Saying
-0.62
POSITIVE LOGITS
assorted
0.95
possibly
0.87
ospace
0.77
etc
0.71
others
0.69
downright
0.67
etc
0.65
other
0.65
yes
0.64
consequently
0.64
Activations Density 0.162%