INDEX
Explanations
mentions of autographs or signing items
references to autographs
New Auto-Interp
Negative Logits
UTION
-0.85
IRO
-0.83
Soda
-0.81
RAFT
-0.80
BE
-0.79
mary
-0.79
ENCE
-0.78
ISION
-0.78
FORE
-0.76
ptives
-0.76
POSITIVE LOGITS
ographs
1.26
ograph
1.21
ographed
1.09
aut
1.04
iques
0.99
istically
0.98
ocom
0.93
archs
0.91
umn
0.91
ogyn
0.89
Activations Density 0.008%