INDEX
Explanations
politically related terms, especially those associated with political parties and figures
references to political parties and their candidates or affiliations
New Auto-Interp
Negative Logits
"$:/
-0.82
phasis
-0.80
imaru
-0.75
cellent
-0.74
peak
-0.73
rompt
-0.72
ailable
-0.71
ramid
-0.71
artment
-0.71
catentry
-0.70
POSITIVE LOGITS
filmmaker
1.12
entrepreneur
1.12
strategist
1.12
newcomer
1.11
guru
1.10
comedian
1.09
thinker
1.08
billionaire
1.07
architect
1.07
superstar
1.07
Activations Density 0.351%