INDEX
Explanations
mentions of a specific name or word, "Homer."
occurrences of the word "Homer" and its variations
New Auto-Interp
Negative Logits
urat
-0.66
Assassins
-0.66
PER
-0.65
Ĥª
-0.65
orld
-0.63
Ws
-0.62
ressing
-0.62
assault
-0.61
resses
-0.60
pots
-0.60
POSITIVE LOGITS
ophone
0.83
cial
0.81
omer
0.80
ically
0.80
oths
0.78
gdala
0.77
ic
0.77
oth
0.77
icably
0.75
olog
0.74
Activations Density 0.018%