INDEX

Explanations

phrases that indicate social dynamics or inequality

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

à¤¾à¤®à¤Ĺ

-0.07

itoris

-0.07

alte

-0.06

hausen

-0.06

bÃ©

-0.06

itori

-0.06

 didFinish

-0.06

ouses

-0.06

loadModel

-0.06

 Sexe

-0.06

POSITIVE LOGITS

 applicable

0.08

.apply

0.07

 Apply

0.07

Apply

0.07

 occurrences

0.07

 apply

0.07

Occ

0.07

 Applies

0.07

apply

0.07

 Ð¿ÑĢÐ¸Ð¼ÐµÐ½

0.07

Activations Density 0.044%