INDEX
Explanations
words related to someone being shamed, humiliated, or having a negative reputation
words related to disgrace and job loss
New Auto-Interp
Negative Logits
aminer
-0.79
itionally
-0.72
fw
-0.72
bra
-0.71
akeru
-0.71
vor
-0.71
ingham
-0.70
argo
-0.69
utra
-0.69
prints
-0.69
POSITIVE LOGITS
embattled
1.19
sacked
1.12
disgr
1.06
ousted
0.97
embroiled
0.88
estranged
0.88
former
0.87
thous
0.84
Former
0.82
Former
0.78
Activations Density 0.016%