INDEX
Explanations
generic phrases indicating comparison or lack of significance
the phrase "not much," indicating a lack of substantial content or variety
New Auto-Interp
Negative Logits
yne
-0.81
izoph
-0.77
kus
-0.74
ilts
-0.71
ĪĴ
-0.70
YES
-0.70
otton
-0.68
initely
-0.68
han
-0.67
Ĥİ
-0.66
POSITIVE LOGITS
anymore
1.22
whatsoever
0.94
consolation
0.88
else
0.87
nor
0.84
bothered
0.84
bother
0.79
noticeable
0.78
avail
0.78
surprises
0.76
Activations Density 0.089%