INDEX
Explanations
words related to social and political issues, including oppression, defiance, and activism
New Auto-Interp
Negative Logits
abase
-0.86
wikipedia
-0.76
baum
-0.74
zzo
-0.73
ĸļ
-0.72
atari
-0.71
Base
-0.71
Keys
-0.70
lear
-0.70
eport
-0.70
POSITIVE LOGITS
bloodshed
1.53
mayhem
1.41
persecution
1.39
injustice
1.37
instability
1.35
violence
1.35
oppression
1.34
repression
1.33
vandalism
1.32
strife
1.32
Activations Density 0.236%