INDEX

Explanations

sexual activity and relations

references to sexual activity, particularly unprotected sex and risky sexual behaviors.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

𝐟

-0.77

 sitter

-0.77

ethene

-0.76

chern

-0.75

dora

-0.75

Blocking

-0.73

apunov

-0.73

 баллов

-0.71

 Percival

-0.71

癮

-0.70

POSITIVE LOGITS

 intercourse

2.55

sex

2.55

 sexual

2.13

Sex

1.73

Sex

1.72

sex

1.66

 relaciones

1.56

SEX

1.55

 sexo

1.54

 relations

1.46

Activations Density 0.050%