INDEX

Explanations

phrases indicating criticism or concern about social issues and injustices

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Verdana

-0.08

ILA

-0.07

.appspot

-0.07

ouns

-0.07

Î¼ÏĨ

-0.07

acter

-0.07

ISCO

-0.07

ila

-0.07

prung

-0.06

_initializer

-0.06

POSITIVE LOGITS

å¦ĤæŃ¤

0.09

ç«Ł

0.07

 such

0.07

 regress

0.07

so

0.07

akit

0.07

 while

0.06

 modern

0.06

 basic

0.06

 grown

0.06

Activations Density 0.024%