INDEX
Explanations
phrases related to seeking or discussing justice
references to justice and related concepts
New Auto-Interp
Negative Logits
acid
-0.81
hetically
-0.78
cair
-0.77
igslist
-0.76
hing
-0.76
ramer
-0.74
urat
-0.73
ergy
-0.73
ÃŁ
-0.69
Dak
-0.69
POSITIVE LOGITS
fulness
0.91
justice
0.89
FUL
0.79
justice
0.79
SYSTEM
0.76
lessness
0.76
injustice
0.71
Gorsuch
0.70
cellence
0.69
ĪĴ
0.68
Activations Density 0.031%