INDEX
Explanations
the name "Vincent" at various levels of activation
mentions of the name "Vincent" or variations thereof
New Auto-Interp
Negative Logits
swick
-0.80
enegger
-0.78
acement
-0.74
dress
-0.74
izoph
-0.72
wart
-0.71
abies
-0.70
ansas
-0.69
gow
-0.69
nob
-0.69
POSITIVE LOGITS
Scully
0.91
Staples
0.90
rette
0.84
Foster
0.84
Lomb
0.82
Bucc
0.81
inho
0.80
Vaughn
0.77
oso
0.77
Kart
0.75
Activations Density 0.073%