INDEX

Explanations

negative assertions or statements of refusal

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ikel

-0.07

amber

-0.07

elman

-0.06

oog

-0.06

agos

-0.06

Ã©ric

-0.06

 Integral

-0.06

osu

-0.06

_Clear

-0.06

_pg

-0.06

POSITIVE LOGITS

 conventional

0.08

 conform

0.07

.Exception

0.06

 convention

0.06

 clutter

0.06

use

0.06

 touching

0.06

 conforms

0.06

ash

0.06

Activations Density 0.044%