INDEX
Explanations
mentions of resignations and removals from positions of authority
New Auto-Interp
Negative Logits
aceae
-0.81
natureconservancy
-0.78
vantage
-0.71
ridges
-0.69
aphael
-0.69
ibaba
-0.67
iba
-0.66
antasy
-0.65
amins
-0.65
00200000
-0.64
POSITIVE LOGITS
sacked
1.02
abruptly
0.92
disgr
0.87
accusing
0.85
whistlebl
0.85
resign
0.84
sergeant
0.82
disgrace
0.82
tenure
0.81
overseeing
0.79
Activations Density 0.120%