INDEX
Explanations
expressions of surprise or strong emotions
instances of the phrase "I was like" which indicates reactions or emotions
New Auto-Interp
Negative Logits
atform
-0.77
ourse
-0.75
Published
-0.74
ribution
-0.74
apers
-0.69
glas
-0.68
ulty
-0.68
omsky
-0.66
wordpress
-0.66
iere
-0.66
POSITIVE LOGITS
lihood
1.04
liest
0.88
lier
0.82
wow
0.79
crazy
0.77
liness
0.67
clock
0.66
crap
0.65
minded
0.65
Crazy
0.63
Activations Density 0.054%