INDEX
Explanations
phrases that indicate surprise or unexpectedness
phrases indicating a sense of surprise or unexpectedness
New Auto-Interp
Negative Logits
urance
-0.74
interstitial
-0.66
riage
-0.65
aneously
-0.65
ergy
-0.65
heit
-0.65
Accessed
-0.65
aneous
-0.64
athering
-0.64
vance
-0.63
POSITIVE LOGITS
me
1.38
outsiders
1.26
anyone
1.14
us
1.10
observers
1.08
anybody
1.08
many
1.07
everyone
0.98
most
0.98
everybody
0.94
Activations Density 0.162%