INDEX
Explanations
phrases indicating something is surprising or unexpected
the concept of surprise associated with certain topics or events
New Auto-Interp
Negative Logits
Accessed
-0.81
querque
-0.79
stream
-0.69
aciously
-0.68
interstitial
-0.67
bow
-0.65
buster
-0.64
vance
-0.64
omo
-0.63
estyles
-0.63
POSITIVE LOGITS
me
1.30
outsiders
1.13
anyone
1.04
us
1.01
him
0.98
anybody
0.97
observers
0.93
everyone
0.93
insiders
0.91
many
0.87
Activations Density 0.237%