INDEX

Explanations

references to homosexuality and its moral implications

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

iverz

-0.08

ofilm

-0.07

StateException

-0.07

endid

-0.07

outu

-0.07

 Porn

-0.07

abyrinth

-0.07

%+

-0.07

uiltin

-0.07

olland

-0.07

POSITIVE LOGITS

 pair

0.07

 anal

0.07

sod

0.07

unn

0.07

 consenting

0.07

/preferences

0.07

 Orient

0.06

 consent

0.06

bis

0.06

 practiced

0.06

Activations Density 0.010%