INDEX
Explanations
phrases related to helping with searches or missing items
New Auto-Interp
Negative Logits
fucks
-0.17
fucking
-0.17
FUCK
-0.17
è©ķ価
-0.16
fuck
-0.16
Fucking
-0.16
Fuck
-0.16
fuck
-0.16
fucked
-0.16
bullshit
-0.15
POSITIVE LOGITS
umpt
0.16
helpers
0.16
jal
0.15
peare
0.15
çģ½
0.15
Friendship
0.14
friendship
0.14
mite
0.14
FAT
0.14
MacDonald
0.14
Activations Density 0.157%