INDEX
Explanations
mentions of the word "Most" followed by a statement
the word "Most" in various contexts
New Auto-Interp
Negative Logits
rompt
-0.73
vest
-0.73
icer
-0.71
pload
-0.69
SECTION
-0.66
Mellon
-0.63
pton
-0.62
multipl
-0.62
thur
-0.61
abad
-0.61
POSITIVE LOGITS
importantly
1.16
notably
0.85
afa
0.84
body
0.82
likely
0.81
important
0.79
tenance
0.78
mornings
0.76
egreg
0.76
entimes
0.75
Activations Density 0.056%