INDEX
Explanations
mentions of romantic or intimate actions like kissing
references to kissing and related actions
New Auto-Interp
Negative Logits
é¾
-1.01
ifted
-0.74
quickShipAvailable
-0.73
ulhu
-0.70
izoph
-0.70
æ©Ł
-0.67
IJ
-0.67
Administ
-0.67
ENCY
-0.67
DISTR
-0.66
POSITIVE LOGITS
goodbye
1.28
kiss
1.14
Kiss
1.07
kisses
1.06
kissing
0.99
kiss
0.99
creen
0.98
kissed
0.95
passionately
0.91
Goodbye
0.83
Activations Density 0.026%