INDEX

Explanations

references to positive relationships and connections among people

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 herself

-0.18

atrice

-0.13

woman

-0.13

å½¼å¥³

-0.12

 Woman

-0.12

å°ıå§Ĳ

-0.12

 Actress

-0.12

daughter

-0.12

Woman

-0.12

 Frau

-0.12

POSITIVE LOGITS

guy

0.14

 boys

0.12

men

0.12

boy

0.12

 gentlemen

0.12

 male

0.12

 guys

0.12

 himself

0.12

 fathers

0.12

 father

0.12

Activations Density 0.684%