INDEX
Explanations
comparisons in sentences
phrases that involve comparisons or references to groups or individuals
New Auto-Interp
Negative Logits
istries
-0.88
arios
-0.71
ONEY
-0.70
akeru
-0.61
liga
-0.61
thanking
-0.59
*/(
-0.59
ERSON
-0.59
ategy
-0.57
ENSE
-0.56
POSITIVE LOGITS
slightest
0.78
usual
0.77
predecessors
0.73
rivals
0.70
ordinary
0.69
usual
0.69
actual
0.68
verages
0.67
counterparts
0.67
competitors
0.64
Activations Density 0.214%