INDEX
Explanations
terms related to social and political issues
references to societal issues and class distinctions
New Auto-Interp
Negative Logits
ONSORED
-0.83
ĸļ
-0.79
ridor
-0.79
cade
-0.77
ilage
-0.75
ourse
-0.73
Character
-0.72
urrence
-0.72
gression
-0.71
ittal
-0.70
POSITIVE LOGITS
rapists
1.27
robbers
1.23
murderers
1.20
hunters
1.14
thieves
1.14
racists
1.13
revolutionaries
1.13
seekers
1.13
anarchists
1.13
dictators
1.12
Activations Density 0.383%