INDEX
Explanations
proper names, specifically the name "Steven"
the name "Steven" across various contexts
New Auto-Interp
Negative Logits
LOAD
-0.77
teen
-0.72
cffffcc
-0.65
supreme
-0.61
eer
-0.61
fect
-0.60
req
-0.60
circ
-0.59
hops
-0.58
agonist
-0.58
POSITIVE LOGITS
Spielberg
1.23
sonian
1.20
Gerrard
1.07
Moff
1.00
herty
0.99
Avery
0.98
Mn
0.94
Hawking
0.92
Universe
0.81
Stam
0.79
Activations Density 0.015%