INDEX

Explanations

statements about intent and meaning in communication

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

á»ĵi

-0.07

LETE

-0.07

abbage

-0.07

iken

-0.06

ropy

-0.06

ken

-0.06

achten

-0.06

ÅŁt

-0.06

Ken

-0.06

aze

-0.06

POSITIVE LOGITS

 meant

0.11

 offense

0.10

 offence

0.10

 intended

0.08

 disrespect

0.08

åĨĴ

0.08

 mean

0.08

 offend

0.07

ynet

0.07

 Mean

0.07

Activations Density 0.028%