INDEX

Explanations

references to social responsibility and advocacy for marginalized groups

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

anel

-0.08

eteria

-0.08

exus

-0.07

uent

-0.07

Scalars

-0.07

asar

-0.06

ledged

-0.06

otify

-0.06

rzy

-0.06

-basket

-0.06

POSITIVE LOGITS

 responsibility

0.08

 responsible

0.08

 Responsibility

0.07

 society

0.07

 communities

0.07

 responsibilities

0.06

 ourselves

0.06

ÏĦÎ±Î¹

0.06

 Responsibilities

0.06

Activations Density 0.034%