INDEX
Explanations
specific references to the term "Homer" with varying levels of activation
instances of the name "Homer" and its variations
New Auto-Interp
Negative Logits
pots
-0.72
birth
-0.65
PER
-0.65
²¾
-0.65
Ĥª
-0.63
Assassins
-0.62
comings
-0.61
orld
-0.60
Ws
-0.60
CLOSE
-0.60
POSITIVE LOGITS
astics
0.84
cial
0.82
ic
0.81
arch
0.80
anz
0.80
archy
0.78
ufact
0.76
ocl
0.76
ophone
0.76
andom
0.76
Activations Density 0.030%