INDEX
Explanations
phrases indicating additional details or information
phrases that include the expression "not to mention."
New Auto-Interp
Negative Logits
hard
-0.77
itton
-0.70
arij
-0.70
twitch
-0.69
psons
-0.66
sis
-0.66
arat
-0.65
heres
-0.65
forums
-0.63
irm
-0.63
POSITIVE LOGITS
mentioning
0.84
lihood
0.80
minus
0.78
nor
0.76
aloud
0.72
mention
0.70
_>
0.68
anymore
0.68
suffice
0.66
whatsoever
0.66
Activations Density 0.021%