INDEX
Explanations
statements with the word "judge" in them
instances of the word "judge" and its variations
New Auto-Interp
Negative Logits
STON
-0.85
spect
-0.81
tera
-0.79
nen
-0.76
cies
-0.72
ccording
-0.71
phabet
-0.70
ultan
-0.69
oresc
-0.69
ivity
-0.69
POSITIVE LOGITS
udge
0.90
hog
0.77
reau
0.74
agate
0.72
gery
0.71
Crusher
0.69
BOX
0.67
gers
0.64
umps
0.64
pees
0.64
Activations Density 0.019%