INDEX
Explanations
proper nouns, specifically names of individuals
references to specific individuals, particularly related to political contexts
New Auto-Interp
Negative Logits
enegger
-0.90
oster
-0.76
alore
-0.73
ilde
-0.73
redit
-0.72
aber
-0.71
anooga
-0.71
inances
-0.70
fleet
-0.70
rarily
-0.69
POSITIVE LOGITS
Sonia
0.93
Gandhi
0.87
Abedin
0.74
cles
0.73
OTUS
0.72
Tome
0.72
Nad
0.69
éĥ
0.68
Rural
0.65
ata
0.64
Activations Density 0.013%