INDEX
Explanations
references to specific groups or communities, such as Roma and Rohingya, in news articles
references to the city of Roma and the Rohingya people
New Auto-Interp
Negative Logits
insula
-0.96
alach
-0.95
orial
-0.87
ointed
-0.85
notation
-0.84
peror
-0.84
nington
-0.80
lished
-0.79
uckland
-0.78
egg
-0.78
POSITIVE LOGITS
Roma
1.06
Tat
0.83
Gy
0.81
Sabha
0.80
Bulgar
0.80
Tos
0.78
Gaz
0.78
Poles
0.77
Gy
0.77
senal
0.76
Activations Density 0.007%