INDEX
Explanations
mentions of the name "Gerry" at varying activations
references to the word "berry" and its variations
New Auto-Interp
Negative Logits
agos
-0.90
isting
-0.76
ahime
-0.75
aepernick
-0.75
iesta
-0.75
agonist
-0.75
icago
-0.75
awar
-0.74
ouch
-0.74
anooga
-0.73
POSITIVE LOGITS
mand
1.05
mite
0.79
erry
0.78
ments
0.77
Gerry
0.76
mph
0.74
bye
0.73
nda
0.72
llo
0.71
ng
0.70
Activations Density 0.018%