INDEX
Explanations
names of people or organizations related to politics or medicine
prominent names and specific terms associated with influential entities or topics
New Auto-Interp
Negative Logits
teen
-0.79
ELL
-0.65
lly
-0.65
Bridge
-0.65
selected
-0.63
Cand
-0.62
©¶æ
-0.62
low
-0.61
pend
-0.61
explan
-0.60
POSITIVE LOGITS
azine
0.85
itude
0.84
Amph
0.83
atform
0.82
ific
0.82
iant
0.75
anca
0.75
ician
0.74
adiator
0.74
indal
0.73
Activations Density 0.025%