INDEX
Explanations
mentions of the concept of justice
references to the concept of justice
New Auto-Interp
Negative Logits
Dak
-0.83
hetically
-0.78
urat
-0.76
ULAR
-0.75
cair
-0.72
ÃŁ
-0.72
iple
-0.72
igslist
-0.72
acid
-0.71
pole
-0.71
POSITIVE LOGITS
justice
1.06
justice
0.94
fulness
0.89
Justice
0.79
FUL
0.76
injustice
0.74
Assistance
0.70
ģĸ
0.70
laureate
0.69
äºĶ
0.69
Activations Density 0.023%