INDEX
Explanations
mentions of former individuals
references to individuals with a prior status or position
New Auto-Interp
Negative Logits
utra
-0.68
fw
-0.68
angles
-0.67
zai
-0.64
aph
-0.63
acs
-0.61
Fram
-0.61
TT
-0.61
grim
-0.61
ggle
-0.61
POSITIVE LOGITS
former
3.32
Former
2.29
Former
2.25
former
2.03
longtime
1.71
formerly
1.62
retired
1.59
latter
1.56
veteran
1.42
disgr
1.36
Activations Density 0.039%