INDEX

Explanations

references to personal pronouns, particularly in the context of relationships and interactions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

agr

-0.07

acro

-0.07

idia

-0.07

ÄįÃŃ

-0.06

oz

-0.06

ÐµÑĢÐº

-0.06

lj

-0.06

ROUGH

-0.06

 vÃ¥r

-0.06

wy

-0.06

POSITIVE LOGITS

/us

0.10

ompiler

0.08

zelf

0.07

ķ

0.07

 Norm

0.06

Buzz

0.06

/her

0.06

Ã¡Å¾

0.06

setIcon

0.06

 throughout

0.06

Activations Density 0.028%