INDEX
Explanations
phrases related to making judgments or assessments of situations or individuals
instances of the word "judge" and its variants in various contexts
New Auto-Interp
Negative Logits
adra
-0.86
ulic
-0.80
lund
-0.79
ols
-0.73
jc
-0.71
heny
-0.71
hall
-0.70
jet
-0.70
uctions
-0.70
Bio
-0.69
POSITIVE LOGITS
harshly
1.00
judging
0.90
whether
0.86
judgement
0.85
fairness
0.83
worthiness
0.82
judgment
0.81
objectively
0.78
qualifications
0.73
criteria
0.72
Activations Density 0.045%