INDEX
Explanations
prominent characters in storytelling
New Auto-Interp
Negative Logits
fuck
-0.19
fucking
-0.18
fuck
-0.17
fucked
-0.17
Fuck
-0.17
ÑĢай
-0.17
fucks
-0.16
Fuck
-0.16
?>>
-0.16
Seb
-0.16
POSITIVE LOGITS
racket
0.17
cro
0.17
fian
0.17
Commissioner
0.16
fiance
0.16
deductions
0.16
Rico
0.16
honest
0.16
Bracket
0.16
nit
0.16
Activations Density 0.055%