INDEX
Explanations
instances of the word "surely" with high activation values
the word "surely" and its variations in various contexts suggesting certainty or emphasis
New Auto-Interp
Negative Logits
ocene
-0.87
arthed
-0.78
insula
-0.78
psey
-0.77
iatus
-0.75
entary
-0.71
NING
-0.70
anwhile
-0.69
apsed
-0.66
rition
-0.66
POSITIVE LOGITS
someday
0.83
è¦
0.72
ought
0.67
deserved
0.66
footed
0.66
deserve
0.65
nud
0.64
await
0.63
surpass
0.62
deserves
0.62
Activations Density 0.029%