INDEX
Explanations
strong opinions or beliefs expressed by the author
statements expressing personal opinions or beliefs
New Auto-Interp
Negative Logits
artney
-0.83
ategory
-0.73
clad
-0.72
=~=~
-0.67
announced
-0.66
agna
-0.66
flats
-0.66
eatured
-0.66
ueless
-0.64
inance
-0.63
POSITIVE LOGITS
76561
0.78
saddened
0.71
onymous
0.70
ĸ
0.69
Chimera
0.69
strategically
0.69
asio
0.68
^^^^
0.67
pse
0.65
capitals
0.65
Activations Density 0.057%