INDEX
Explanations
names of a specific individual, "Jake"
the name "Jake" across various contexts
New Auto-Interp
Negative Logits
conduc
-0.78
iary
-0.75
iated
-0.73
amera
-0.65
acent
-0.64
subst
-0.63
arily
-0.62
oppable
-0.62
Ħ¢
-0.62
Ü
-0.62
POSITIVE LOGITS
glers
1.00
Gy
0.89
ansas
0.88
Jake
0.83
unin
0.83
Skywalker
0.81
McGee
0.77
EStream
0.76
cki
0.76
caster
0.75
Activations Density 0.021%