INDEX

Explanations

phrases indicating initial reactions or responses to various events or actions

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

gota

-0.07

celik

-0.06

emek

-0.06

cek

-0.06

campo

-0.06

ationship

-0.06

icari

-0.06

Ø§Ø³Ùħ

-0.06

ftware

-0.06

terdam

-0.06

POSITIVE LOGITS

aph

0.07

own

0.07

->{'

0.07

mixed

0.07

bage

0.07

rement

0.06

Åº

0.06

gre

0.06

vol

0.06

zen

0.06

Activations Density 0.006%