INDEX
Explanations
information provided by spokespeople in news articles
references to spokespersons or representatives
New Auto-Interp
Negative Logits
rament
-0.72
spons
-0.68
SPONSORED
-0.65
acca
-0.65
bearded
-0.64
omer
-0.64
pan
-0.63
aughs
-0.63
don
-0.62
tein
-0.62
POSITIVE LOGITS
Anne
1.17
Marie
1.10
Louise
1.04
Nicole
1.00
Isabel
0.94
Anne
0.94
Christina
0.93
Elizabeth
0.92
Elaine
0.90
Mary
0.89
Activations Density 0.044%