INDEX
Explanations
references to political entities and geographical locations, particularly related to Bangladesh
New Auto-Interp
Negative Logits
aarrggbb
-0.64
marvin
-0.56
Erm
-0.52
Hers
-0.50
ymce
-0.50
retire
-0.49
webdriver
-0.49
reti
-0.48
Carlsbad
-0.47
Erm
-0.47
POSITIVE LOGITS
Bangladesh
1.24
Banglades
1.12
Bangladesh
1.11
Dhaka
1.05
Bengali
1.02
ladesh
0.98
AndEndTag
0.94
Efq
0.88
Kolkata
0.85
Chowdhury
0.85
Activations Density 0.032%