INDEX
Explanations
judgment-related words or phrases
occurrences of the term "judgment" and its variations
New Auto-Interp
Negative Logits
href
-0.82
tails
-0.69
heres
-0.68
ieri
-0.68
repeat
-0.66
tail
-0.64
chell
-0.64
vae
-0.63
everal
-0.62
mers
-0.62
POSITIVE LOGITS
judgment
1.30
judgement
1.23
judgments
1.07
Judgment
1.07
eering
0.91
naire
0.83
ACTIONS
0.79
jud
0.77
jud
0.74
debtor
0.71
Activations Density 0.015%