INDEX
Explanations
mentions of specific political figures like President Obama
mentions of political figures, particularly Barack Obama and Donald Trump
New Auto-Interp
Negative Logits
MJ
-0.77
Offline
-0.72
NP
-0.66
mats
-0.65
ãĤ¯
-0.64
notebooks
-0.64
dates
-0.64
Tokens
-0.64
probabilities
-0.61
coins
-0.61
POSITIVE LOGITS
ÃŃs
0.88
vetoed
0.80
Presents
0.78
Jinping
0.76
assembled
0.75
pard
0.72
iets
0.69
hower
0.68
enstein
0.68
Appears
0.68
Activations Density 0.062%