INDEX

Explanations

words related to titles and headings in the document

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

arily

-0.07

ensch

-0.07

zik

-0.07

cul

-0.07

ekk

-0.07

-0.06

erland

-0.06

ritz

-0.06

antro

-0.06

æ·»

-0.06

POSITIVE LOGITS

=title

0.08

_singular

0.07

ãĥ³ãĥĩ

0.07

 Robbins

0.06

wargs

0.06

/head

0.06

-area

0.06

-less

0.06

arda

0.06

orous

0.06

Activations Density 0.015%