INDEX
Explanations
references to colonialism and its historical context
New Auto-Interp
Negative Logits
Iranian
-0.14
Kurdish
-0.14
Jordan
-0.14
aval
-0.14
Belarus
-0.14
ARI
-0.14
intr
-0.14
intr
-0.14
Media
-0.14
red
-0.14
POSITIVE LOGITS
colonial
0.35
æ®ĸ
0.34
colon
0.31
colonies
0.31
icolon
0.28
Colonial
0.28
æ¤į
0.28
colony
0.27
Colon
0.26
colon
0.25
Activations Density 0.118%