INDEX
Explanations
mentions of the word "Gun" in the text
New Auto-Interp
Negative Logits
Strawberry
-0.75
ç«
-0.71
Archdemon
-0.69
Hort
-0.67
Dragons
-0.67
Virtue
-0.64
coli
-0.63
Humph
-0.60
Advis
-0.59
uthor
-0.59
POSITIVE LOGITS
powder
1.74
nery
1.32
smith
1.23
ner
1.20
ners
1.16
nar
1.14
ning
1.12
fight
1.11
boats
1.11
fighters
1.11
Activations Density 0.032%