INDEX

Explanations

references to violence and abuse against vulnerable populations

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 thuis

-0.06

aters

-0.06

ollen

-0.06

OOSE

-0.06

nnen

-0.06

arks

-0.06

éĢĢ

-0.06

 Cord

-0.06

 stockings

-0.06

Shr

-0.06

POSITIVE LOGITS

ulumi

0.07

 camp

0.07

EU

0.07

 Animalia

0.07

ilib

0.06

dou

0.06

acman

0.06

à¸Ĭà¸²à¸¢

0.06

 Morrison

0.06

 Herman

0.06

Activations Density 0.009%