INDEX
Explanations
comparative terms indicating judgment or evaluation, such as "better" or "worse"
comparative adjectives that express improvement or decline
New Auto-Interp
Negative Logits
esville
-0.74
ettes
-0.72
Gazette
-0.70
MG
-0.65
gemony
-0.65
SEA
-0.64
NH
-0.64
essee
-0.64
Nob
-0.63
trl
-0.63
POSITIVE LOGITS
than
1.06
Than
1.01
than
0.90
behaved
0.88
suited
0.82
eleph
0.79
acquainted
0.68
vers
0.68
iation
0.68
bang
0.68
Activations Density 0.049%