INDEX

Explanations

terms related to allegations of misconduct and unethical behavior

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

+offset

-0.08

okable

-0.07

abcdefghijklmnop

-0.07

ezi

-0.07

duk

-0.07

é¼

-0.07

à¸²à¸°

-0.07

zcze

-0.07

Ð¾Ð²Ð¸

-0.07

Ãªu

-0.07

POSITIVE LOGITS

bul

0.06

ences

0.06

 communication

0.06

ler

0.06

cl

0.06

isle

0.06

ires

0.06

ana

0.06

Activations Density 0.016%