INDEX

Explanations

expressions of moral judgment and accountability in social contexts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

rey

-0.08

agi

-0.08

edb

-0.08

å¾Ĵ

-0.07

Î´Î¬

-0.07

ainment

-0.07

UNUSED

-0.07

ÄįnÃ©

-0.07

/Branch

-0.07

-Disposition

-0.07

POSITIVE LOGITS

 wanting

0.08

 concern

0.08

 considering

0.07

ant

0.07

 skepticism

0.07

 concerns

0.06

 wants

0.06

ulo

0.06

 concluded

0.06

 limited

0.06

Activations Density 0.026%