INDEX

Explanations

phrases that suggest reasoning or justification

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

zimmer

-0.07

lem

-0.07

ãĤ¿ãĥ«

-0.06

ÑıÐ±

-0.06

ients

-0.06

ERSIST

-0.06

 Resist

-0.06

Ð½ÑıÑĤ

-0.06

Gry

-0.06

acia

-0.06

POSITIVE LOGITS

 therefore

0.31

 Therefore

0.25

Therefore

0.24

 donc

0.19

ï¼ĮæīĢä»¥

0.18

 thus

0.18

 daher

0.18

 accordingly

0.18

 hence

0.17

 wiÄĻc

0.17

Activations Density 0.073%