INDEX

Explanations

phrases related to accountability and responsibility in news contexts

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

het

-0.06

builtin

-0.06

ifes

-0.06

sled

-0.06

aukee

-0.06

.od

-0.06

 Tata

-0.06

ayla

-0.06

aut

-0.06

ikk

-0.06

POSITIVE LOGITS

TS

0.09

tsx

0.09

ts

0.08

(ts

0.08

TS

0.07

ÙħØ²

0.07

RING

0.07

IDO

0.07

 Jennings

0.07

ï»¿↵↵

0.06

Activations Density 0.001%