INDEX

Explanations

references to gendered individuals and their roles in narratives

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

agina

-0.08

lech

-0.07

 themselves

-0.07

illow

-0.06

 itself

-0.06

 personalities

-0.06

acher

-0.06

mae

-0.06

ÑģÑĤÐ¸ÑĤ

-0.06

.Static

-0.05

POSITIVE LOGITS

 whose

0.09

who

0.09

whose

0.09

Ø¹Ø§ÙĨ

0.08

brains

0.07

umpt

0.07

 whom

0.07

hattan

0.07

åĽ£

0.07

UnderTest

0.06

Activations Density 0.026%