INDEX

Explanations

Architecture

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 architecture

-0.95

architecture

-0.78

+#+#

-0.78

 cherchés

-0.73

 Shakspeare

-0.70

AutoScale

-0.70

 purpoſe

-0.68

verwijspagina

-0.66

ंदीखरीदारी

-0.65

 ivelany

-0.62

POSITIVE LOGITS

LoS

0.54

"""

0.48

 aimé

0.47

def

0.47

 vendus

0.47

deviantart

0.46

LEMMA

0.45

 fVar

0.45

of

0.44

on

0.44

Activations Density 0.118%