INDEX
Explanations
names related to prominent political figures
proper nouns related to notable individuals, particularly political figures and historical figures
New Auto-Interp
Negative Logits
atari
-1.04
scape
-0.89
tons
-0.85
tml
-0.85
ansas
-0.82
ris
-0.78
lords
-0.78
manship
-0.77
lan
-0.77
roth
-0.77
POSITIVE LOGITS
Galile
0.96
Rossi
0.94
Nicola
0.84
Tesla
0.83
zzi
0.81
Simone
0.79
Sturgeon
0.79
Äį
0.75
Nero
0.74
iser
0.74
Activations Density 0.024%