INDEX

Explanations

elements and their attributes in HTML code

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ussen

-0.09

ffd

-0.08

ngine

-0.07

 HÃ¼s

-0.07

ahy

-0.07

Ã¡y

-0.07

âĢĮØ§ÙĨ

-0.07

ï¼Ń

-0.07

ItemAt

-0.07

acia

-0.07

POSITIVE LOGITS

ar

0.07

0.06

ropa

0.06

ince

0.06

inst

0.05

ome

0.05

gen

0.05

 persuasion

0.05

Activations Density 0.007%