INDEX
Explanations
phrases indicating comparison or contrast
phrases that indicate contrast or opposition
New Auto-Interp
Negative Logits
obyl
-0.77
liam
-0.76
breakers
-0.76
nce
-0.74
ells
-0.73
inho
-0.71
estern
-0.70
negie
-0.67
assies
-0.67
aja
-0.66
POSITIVE LOGITS
itably
0.83
opposed
0.79
necessarily
0.71
preferring
0.71
allowing
0.67
isons
0.67
materially
0.67
favoring
0.67
chronological
0.66
letting
0.64
Activations Density 0.013%