INDEX

Explanations

themes related to self-actualization and personal identity

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 genders

-0.06

neutral

-0.06

wy

-0.06

zac

-0.06

oxel

-0.05

 oath

-0.05

andering

-0.05

ritten

-0.05

archs

-0.05

POSITIVE LOGITS

Ð¼Ð¾ÑĤ

0.08

 security

0.07

 motivational

0.07

Security

0.07

 levels

0.07

ìļķ

0.07

 Security

0.06

 discrepan

0.06

Mot

0.06

 stages

0.06

Activations Density 0.003%