INDEX

Explanations

eagerly

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Efq

-0.88

ReusableCell

-0.72

 Houſe

-0.72

 itſelf

-0.66

 myſelf

-0.66

 houſe

-0.66

 Monfieur

-0.64

wpi

-0.63

ſelf

-0.63

 raiſ

-0.63

POSITIVE LOGITS

 import

0.46

ActionCreators

0.44

 experiment

0.44

store

0.43

bre

0.43

悦

0.42

chtenstein

0.42

en

0.41

times

0.41

ubro

0.40

Activations Density 0.006%