INDEX

Explanations

rhetorical questions expressing disbelief or emphasis

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

edith

-0.07

loub

-0.07

estion

-0.07

oleon

-0.07

stash

-0.07

ÑĥÑĩÐ°

-0.07

sic

-0.07

ĶĶ

-0.06

 forth

-0.06

_PID

-0.06

POSITIVE LOGITS

rez

0.07

 arrow

0.06

bs

0.06

agh

0.06

ault

0.06

pek

0.06

 Freed

0.06

izo

0.06

 interesting

0.06

0.05

Activations Density 0.006%