INDEX
Explanations
the name "Donald Trump" in various contexts
mentions of Donald Trump
New Auto-Interp
Negative Logits
pole
-0.81
dots
-0.77
foss
-0.68
erb
-0.67
duct
-0.65
esville
-0.64
notebooks
-0.64
shapeshifter
-0.63
papers
-0.63
kers
-0.62
POSITIVE LOGITS
Jinping
0.76
terness
0.75
Bid
0.73
Abrams
0.70
thora
0.68
Marshall
0.67
Doyle
0.66
°
0.66
ª
0.65
Griffin
0.64
Activations Density 0.116%