INDEX
Explanations
references to the word "Killer"
references to the term "killer" in various contexts
New Auto-Interp
Negative Logits
ational
-0.93
ional
-0.89
bles
-0.82
herty
-0.81
heny
-0.77
uration
-0.76
ourced
-0.75
ibly
-0.74
erous
-0.72
ury
-0.71
POSITIVE LOGITS
instinct
0.94
killer
0.90
spree
0.84
knife
0.83
whales
0.81
intent
0.78
whale
0.78
mails
0.76
blow
0.74
Killer
0.74
Activations Density 0.059%