INDEX

Explanations

expressions related to responsibility and consequences

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

iver

-0.07

 Prem

-0.07

Ø¨Ø§ÙĨ

-0.07

Prem

-0.07

(dead

-0.06

Rac

-0.06

oproject

-0.06

?}",

-0.06

roof

-0.06

udit

-0.06

POSITIVE LOGITS

opia

0.06

illo

0.06

 retro

0.06

 overseas

0.06

Ab

0.06

 Country

0.06

uru

0.06

IfNeeded

0.06

 foreign

0.06

Activations Density 0.008%