INDEX
Explanations
words related to judgment or judicial matters
words related to judgment
New Auto-Interp
Negative Logits
cffffcc
-0.75
tera
-0.74
worth
-0.73
replace
-0.73
WD
-0.69
riott
-0.68
abee
-0.67
ttes
-0.67
EDIT
-0.65
Alz
-0.65
POSITIVE LOGITS
Jud
1.23
jud
1.19
gement
1.14
gements
1.00
satell
0.99
gments
0.96
itsu
0.96
unden
0.95
corrid
0.90
artif
0.88
Activations Density 0.008%