INDEX

Explanations

formal statements regarding guidelines or policies

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

abar

-0.07

 unle

-0.06

Ø§Ø±Ùĩ

-0.06

_flutter

-0.06

aber

-0.06

ä¾¯

-0.06

obb

-0.06

.Shared

-0.05

umen

-0.05

Ste

-0.05

POSITIVE LOGITS

 precedence

0.10

çŁ

0.08

uhe

0.07

trak

0.07

 contradictions

0.07

ogne

0.07

-vars

0.07

_CAPACITY

0.06

 Conflict

0.06

 contradiction

0.06

Activations Density 0.002%