INDEX

Explanations

terms and phrases related to human rights

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

nder

-0.07

oe

-0.07

ointment

-0.06

umas

-0.06

oo

-0.06

.Lookup

-0.06

Bes

-0.06

iquid

-0.06

semblies

-0.06

ilm

-0.06

POSITIVE LOGITS

 violation

0.09

 violations

0.08

viol

0.08

esktop

0.07

 viol

0.07

 Viol

0.07

kova

0.07

Ã¸re

0.07

PUS

0.07

Violation

0.07

Activations Density 0.008%