INDEX
Explanations
proper names or organizations being in a leading position
phrases indicating leadership or groups being led
New Auto-Interp
Negative Logits
Leilan
-0.65
comfort
-0.62
Showdown
-0.59
issues
-0.57
Crash
-0.56
DISTR
-0.56
irlf
-0.56
srfAttach
-0.54
redundancy
-0.54
McCorm
-0.54
POSITIVE LOGITS
gers
1.05
by
1.01
ges
0.97
by
0.92
gling
0.86
ger
0.84
ging
0.83
bys
0.83
ged
0.82
eric
0.82
Activations Density 0.055%