INDEX

Explanations

references to violent crimes and assaults

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ugas

-0.07

esser

-0.07

ocale

-0.07

rikes

-0.06

 commission

-0.06

alia

-0.06

osc

-0.06

arkin

-0.06

ushman

-0.06

inka

-0.06

POSITIVE LOGITS

bk

0.07

_brightness

0.06

 heck

0.06

#",

0.06

TRGL

0.06

-transitional

0.06

RIORITY

0.06

-LAST

0.06

 ../../

0.06

heck

0.06

Activations Density 0.016%