INDEX
Explanations
phrases related to political figures and their actions
possessive forms related to President Obama and President Trump
New Auto-Interp
Negative Logits
anges
-0.82
olkien
-0.81
ragon
-0.81
lehem
-0.79
hari
-0.78
Reviewer
-0.77
cam
-0.73
Developer
-0.73
jun
-0.73
Kubrick
-0.73
POSITIVE LOGITS
own
1.22
newest
1.11
penchant
1.04
insistence
1.00
signature
1.00
latest
0.99
foray
0.97
playbook
0.94
antics
0.93
predicament
0.93
Activations Density 0.162%