INDEX

Explanations

words and phrases indicating power dynamics in social contexts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

TION

-0.08

 therefore

-0.07

sti

-0.07

atu

-0.07

iyon

-0.06

 dann

-0.06

æĽ

-0.06

xed

-0.06

Ãºs

-0.06

stre

-0.06

POSITIVE LOGITS

 there

0.07

isos

0.07

ugen

0.07

icos

0.06

 most

0.06

avy

0.06

zeit

0.06

Æ°a

0.06

there

0.06

ppard

0.06

Activations Density 0.026%