INDEX

Explanations

patterns or structures in formatted data or code

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

alls

-0.08

å³

-0.07

oog

-0.07

 YÃĸ

-0.07

ober

-0.07

Cyr

-0.07

 âĨĴ↵↵

-0.07

 eskort

-0.07

 favour

-0.06

ÎµÏħ

-0.06

POSITIVE LOGITS

ë²Į

0.06

avy

0.06

.digital

0.06

_exempt

0.06

ro

0.06

0.05

ankan

0.05

Violation

0.05

.Encode

0.05

vik

0.05

Activations Density 0.013%