INDEX
Explanations
comparisons and measurements between different entities
phrases indicating comparisons or contrasts
New Auto-Interp
Negative Logits
ANCE
-0.76
hardt
-0.73
iband
-0.72
shire
-0.67
Bad
-0.66
Premium
-0.65
*/(
-0.65
arna
-0.64
assian
-0.64
colo
-0.62
POSITIVE LOGITS
usual
0.93
ours
0.81
previous
0.81
traditional
0.74
those
0.74
other
0.73
actual
0.71
customary
0.69
peers
0.69
comparable
0.68
Activations Density 0.062%