INDEX
Explanations
references to the country Uganda
references to Uganda
New Auto-Interp
Negative Logits
pard
-0.81
alty
-0.80
perature
-0.79
uters
-0.77
âĸ¬
-0.67
hent
-0.66
voy
-0.65
ocular
-0.65
place
-0.65
utenant
-0.65
POSITIVE LOGITS
andan
1.38
Ug
1.14
Uganda
1.02
rica
0.91
Haram
0.89
Erit
0.85
Zamb
0.81
istani
0.80
Rw
0.77
rican
0.77
Activations Density 0.011%