INDEX
Explanations
references to colonization and its impact on cultures and societies
New Auto-Interp
Negative Logits
queer
-0.15
osti
-0.15
UU
-0.15
ADB
-0.14
progressive
-0.14
homophobic
-0.14
leston
-0.14
fucks
-0.14
èµ·
-0.14
phinx
-0.14
POSITIVE LOGITS
liberty
0.23
Liberty
0.21
itbart
0.18
Freedom
0.17
freedom
0.17
alink
0.16
Heritage
0.16
Classical
0.16
WND
0.16
ä¼Ŀ
0.15
Activations Density 0.771%