INDEX
Explanations
phrases related to actions and their consequences, particularly in a causal context
"Do" followed by a negative consequence
doing good or harm
New Auto-Interp
Negative Logits
linkovi
-0.75
AssemblyCulture
-0.74
ThroughAttribute
-0.72
estekak
-0.68
useAppContext
-0.68
Tikang
-0.66
сылкі
-0.65
Geplaatst
-0.64
UnusedPrivate
-0.63
كومونز
-0.63
POSITIVE LOGITS
justice
0.92
justice
0.74
Justice
0.70
Justice
0.70
JUSTICE
0.69
service
0.61
harm
0.59
service
0.58
wonders
0.57
justicia
0.56
Activations Density 0.140%