INDEX

Explanations

phrases that indicate responses to medical treatments or conditions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

typed

-0.07

IGENCE

-0.07

_typ

-0.07

opus

-0.07

urge

-0.07

amarin

-0.07

unda

-0.07

pone

-0.07

á»§ng

-0.07

evi

-0.07

POSITIVE LOGITS

brief

0.06

 Ø§ÙĨØª

0.06

KK

0.06

ASE

0.05

\Bundle

0.05

 cheers

0.05

Pes

0.05

UserDefaults

0.05

gag

0.05

 humid

0.05

Activations Density 0.007%