INDEX
Explanations
topics related to justice and reconciliation
New Auto-Interp
Negative Logits
egg
-0.17
üz
-0.15
oyer
-0.15
кав
-0.15
oupon
-0.14
roke
-0.14
?=.*
-0.13
ãĤĵãģ¨
-0.13
pán
-0.13
lemetry
-0.13
POSITIVE LOGITS
Truth
0.28
truth
0.28
Truth
0.26
repar
0.26
victims
0.26
truth
0.25
healing
0.24
reconciliation
0.24
apologies
0.23
restitution
0.23
Activations Density 0.076%