INDEX

Explanations

references to personal pronouns and possessive forms of "he" and "his."

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

reeNode

-0.08

ogs

-0.07

omik

-0.07

lasses

-0.07

¶Į

-0.07

eln

-0.07

ÐºÐ¾Ð²Ñĸ

-0.07

ÐµÐ²Ð¸Ð´

-0.07

ertainment

-0.07

usercontent

-0.07

POSITIVE LOGITS

or

0.10

/her

0.08

/she

0.06

onom

0.06

å½¼å¥³

0.06

(.)

0.06

sil

0.06

she

0.06

yo

0.06

osph

0.06

Activations Density 0.004%