INDEX

Explanations

statements reflecting traditional views on gender roles and societal expectations

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

egin

-0.08

gio

-0.07

heel

-0.07

 aime

-0.07

ëŁ

-0.07

chwitz

-0.07

ikk

-0.07

elo

-0.07

ÑĢÐµÐ¼

-0.07

unate

-0.07

POSITIVE LOGITS

 convinced

0.06

 premises

0.06

 view

0.06

 Incoming

0.06

 notions

0.06

("

0.05

ÛĮØ±ÛĮ

0.05

 consequently

0.05

 belief

0.05

_this

0.05

Activations Density 0.031%