INDEX

Explanations

phrases that express suggestions or improvement related to processes and conditions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

vÃŃc

-0.07

tÄĽÅ¾

-0.07

atee

-0.07

bies

-0.07

_fu

-0.07

gnore

-0.07

(æĹ¥

-0.07

 prostitu

-0.07

lio

-0.07

semicolon

-0.07

POSITIVE LOGITS

ingly

0.10

etheless

0.10

uably

0.08

 kidding

0.08

umably

0.08

beit

0.07

pecially

0.07

icularly

0.07

ally

0.07

arily

0.07

Activations Density 0.407%