INDEX
Explanations
special characters or non-standard glyphs in the text
New Auto-Interp
Negative Logits
fucked
-0.21
fucking
-0.21
fucks
-0.21
fuck
-0.20
bitch
-0.19
shit
-0.19
fuck
-0.18
prostitutes
-0.18
prostitute
-0.18
shit
-0.18
POSITIVE LOGITS
adorable
0.25
preschool
0.25
kidd
0.24
homeschool
0.23
mommy
0.23
playful
0.23
children
0.23
cute
0.22
fun
0.22
adventures
0.22
Activations Density 0.007%