INDEX
Explanations
references to adult-themed content and relationships
New Auto-Interp
Negative Logits
Salah
-0.16
-fetch
-0.14
pras
-0.14
prostituer
-0.14
bryster
-0.13
mpr
-0.13
athlon
-0.13
طر
-0.13
YST
-0.13
pedia
-0.13
POSITIVE LOGITS
mature
0.25
Mature
0.23
Hook
0.23
dating
0.22
matures
0.21
discrete
0.21
hook
0.21
free
0.21
chat
0.20
Hook
0.20
Activations Density 0.044%