INDEX
Explanations
proper names ("Billy" in this case)
mentions of the name "Billy."
New Auto-Interp
Negative Logits
ually
-0.87
ional
-0.82
aries
-0.78
arily
-0.77
yrinth
-0.74
ATING
-0.74
eled
-0.74
ially
-0.72
ational
-0.71
ators
-0.70
POSITIVE LOGITS
Goat
0.90
Graham
0.90
cock
0.86
xtap
0.85
Joel
0.83
Dee
0.78
Gunn
0.78
Hallow
0.78
Slater
0.78
Nelson
0.77
Activations Density 0.035%