INDEX
Explanations
the name "Cole" at varying levels of activation
references to the name "Cole."
New Auto-Interp
Negative Logits
oppable
-0.68
ĺħ
-0.65
channelAvailability
-0.64
ocre
-0.64
guiActiveUn
-0.62
hots
-0.61
oslav
-0.61
nomine
-0.60
ģ
-0.59
>>\
-0.58
POSITIVE LOGITS
tto
1.09
brate
0.86
opter
0.84
tti
0.84
Porter
0.80
mons
0.79
opsis
0.78
phrine
0.75
bourne
0.75
lette
0.74
Activations Density 0.033%