INDEX
Explanations
the name "Ben Affleck"
references to the actor Ben Affleck
New Auto-Interp
Negative Logits
North
-0.69
exp
-0.67
spirits
-0.66
tab
-0.66
IC
-0.65
virtual
-0.63
Native
-0.63
Odyssey
-0.63
mult
-0.63
PRO
-0.63
POSITIVE LOGITS
leck
5.39
lest
1.08
hoff
1.06
loe
1.01
wagen
0.99
lus
0.97
tsky
0.96
lett
0.94
cki
0.94
anski
0.93
Activations Density 0.021%