INDEX

Explanations

elements related to violent or disturbing themes

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ÐµÐ½ÑĮ

-0.06

NXT

-0.06

 Worth

-0.06

 Eating

-0.06

vej

-0.06

_BB

-0.06

agine

-0.06

ror

-0.05

erv

-0.05

 Guerrero

-0.05

POSITIVE LOGITS

Cas

0.09

cas

0.09

kaz

0.08

Cas

0.07

CAS

0.07

ombat

0.07

 Morocco

0.07

cas

0.07

CAS

0.07

ï¸ı

0.07

Activations Density 0.002%