INDEX
Explanations
proper names, specifically focusing on the name "Rudd" and "Ridley."
mentions of the name "Rudd."
New Auto-Interp
Negative Logits
________________________
-0.80
plates
-0.77
________________________________
-0.71
bidden
-0.68
corpus
-0.66
Renaissance
-0.66
gran
-0.64
×ij
-0.63
________________________________________________________________
-0.63
meric
-0.63
POSITIVE LOGITS
Rudd
1.14
ock
0.91
erer
0.88
cliffe
0.86
olph
0.86
uling
0.86
aby
0.85
omon
0.85
enthal
0.85
ieri
0.84
Activations Density 0.006%