INDEX
Explanations
expressions of personal experiences and requests for information related to pain management and ancestry
New Auto-Interp
Negative Logits
FUCK
-0.20
fucking
-0.20
fucks
-0.18
Fuck
-0.18
Fucking
-0.17
fucked
-0.16
Fuck
-0.16
fuck
-0.15
shit
-0.15
WTF
-0.15
POSITIVE LOGITS
grands
0.17
Hub
0.16
Hub
0.16
ccione
0.15
Engl
0.15
Hubb
0.15
lue
0.15
Bless
0.14
luv
0.14
ole
0.14
Activations Density 0.362%