INDEX
Explanations
proper nouns related to politics and public figures
prominent political figures mentioned in the document
New Auto-Interp
Negative Logits
olulu
-0.73
berra
-0.65
Flavoring
-0.61
unct
-0.59
apter
-0.58
reen
-0.58
lihood
-0.56
20439
-0.55
mel
-0.55
alys
-0.54
POSITIVE LOGITS
himself
0.79
supporters
0.74
detractors
0.72
's
0.71
accuser
0.69
TAMADRA
0.68
enegger
0.65
backers
0.65
aides
0.64
risked
0.64
Activations Density 0.320%