INDEX
Explanations
phrases indicating uncertainty about future outcomes
instances of the phrase "to be."
New Auto-Interp
Negative Logits
hops
-0.65
Advertisement
-0.59
hamm
-0.59
nets
-0.56
simultane
-0.56
jails
-0.54
pointers
-0.54
Boots
-0.53
suspended
-0.53
banned
-0.52
POSITIVE LOGITS
ggles
0.98
asty
0.94
othy
0.86
pless
0.85
psy
0.83
ads
0.76
fu
0.75
asted
0.74
lling
0.74
adies
0.73
Activations Density 0.312%