INDEX
Explanations
phrases that indicate a comparison or relative positioning
the term "relatively" used in various contexts
New Auto-Interp
Negative Logits
Polo
-0.83
ieu
-0.80
inis
-0.74
tein
-0.73
arta
-0.71
halla
-0.70
Landing
-0.70
iens
-0.70
andel
-0.70
rings
-0.68
POSITIVE LOGITS
tame
0.98
unpop
0.96
unaffected
0.94
unchanged
0.91
insignificant
0.90
innocuous
0.89
scarce
0.89
insensitive
0.87
benign
0.87
inexpensive
0.86
Activations Density 0.014%