INDEX

Explanations

references to personal relationships and social connections

Biographical text snippets describing people (predominantly women) along with their roles, relationships, or actions.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 himself

-0.17

å¦»

-0.15

his

-0.10

 Wife

-0.10

 wife

-0.10

his

-0.10

 Himself

-0.10

 ÙĨÙģØ³Ùĩ

-0.09

 stesso

-0.09

Jr

-0.09

POSITIVE LOGITS

 herself

0.28

ä¸Īå¤«

0.14

her

0.14

 ÑģÐ°Ð¼Ð°

0.13

 haar

0.12

ovÃ¡

0.12

 husband

0.11

 ÑģÐºÐ°Ð·Ð°Ð»Ð°

0.11

 jejÃŃ

0.11

 Ð¼Ð¾Ð³Ð»Ð°

0.10

Activations Density 2.081%