INDEX
Explanations
sentences presenting strong opinions or emphasizing a point
expressions of opinion or assertion
New Auto-Interp
Negative Logits
nton
-0.59
obbies
-0.59
culosis
-0.57
elight
-0.56
oqu
-0.56
redited
-0.56
Pict
-0.55
sequently
-0.55
egu
-0.55
ospital
-0.54
POSITIVE LOGITS
fucking
0.71
goddamn
0.63
bullshit
0.62
fuck
0.61
fucked
0.60
Canaver
0.60
Voldemort
0.60
_>
0.60
fuck
0.59
physicists
0.59
Activations Density 1.721%