INDEX

Explanations

concepts related to ethical reasoning and judgments

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

piler

-0.06

gg

-0.06

ÑĢÐ°Ð²Ð¸

-0.06

elong

-0.06

 https

-0.06

oucher

-0.06

imb

-0.06

pla

-0.06

 elong

-0.06

oulos

-0.06

POSITIVE LOGITS

(es

0.08

Ä°S

0.07

âĢĲ

0.07

'n

0.07

 eskort

0.07

uniacid

0.06

\xaa

0.06

activex

0.06

BadRequest

0.06

'].'

0.06

Activations Density 0.000%