INDEX
Explanations
themes related to justice and historical accountability, particularly concerning reparations and reconciliation
New Auto-Interp
Negative Logits
egg
-0.18
rael
-0.16
otel
-0.15
Dummy
-0.15
кав
-0.15
èĶ
-0.14
اÙĨÙĪ
-0.14
üz
-0.14
alty
-0.14
Waist
-0.13
POSITIVE LOGITS
Truth
0.30
truth
0.29
victims
0.28
Truth
0.26
victim
0.26
reconciliation
0.26
healing
0.26
Victims
0.25
Victim
0.24
truth
0.24
Activations Density 0.053%