INDEX
Explanations
proper nouns or names of people and places
references to specific political figures or leaders
New Auto-Interp
Negative Logits
annah
-0.74
EEE
-0.72
acc
-0.68
ggy
-0.67
lynn
-0.66
auna
-0.65
chromos
-0.64
pects
-0.64
ankind
-0.64
wikipedia
-0.62
POSITIVE LOGITS
Maduro
3.50
Nicarag
1.45
cigars
1.08
Mad
1.05
Nicaragua
1.04
Venezuel
1.03
Rust
1.03
henko
1.01
cigar
0.99
Matte
0.97
Activations Density 0.033%