INDEX

Explanations

instances or occurrences of actions and events, particularly those related to experiences

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

phere

-0.07

ackle

-0.07

ÄĽÅ¾

-0.07

chw

-0.07

_TAC

-0.07

iddy

-0.07

polator

-0.07

 artÄ±k

-0.07

quip

-0.06

ÙĨÙĪÛĮØ³

-0.06

POSITIVE LOGITS

 several

0.16

 occasionally

0.16

 twice

0.15

 sometimes

0.15

 occasions

0.13

 often

0.12

 instances

0.12

sometimes

0.12

 Twice

0.12

 frequently

0.12

Activations Density 0.162%