INDEX
Explanations
words related to negative impacts or flaws
words and phrases related to damage or blemishes on reputation or appearance
New Auto-Interp
Negative Logits
agy
-0.78
WER
-0.76
yip
-0.76
raltar
-0.75
cffffcc
-0.73
trak
-0.72
ultane
-0.72
ultan
-0.71
guiActiveUn
-0.71
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.70
POSITIVE LOGITS
tarn
1.02
stains
0.91
stain
0.87
luster
0.82
stained
0.82
coating
0.82
smear
0.81
uous
0.80
linen
0.77
washed
0.77
Activations Density 0.167%