INDEX

Explanations

für

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

usermodel

-0.66

rawDesc

-0.64

LookAnd

-0.63



-0.60

maphore

-0.57

elemField

-0.57

umburg

-0.56

__*/

-0.56

 unknownFields

-0.56

zenta

-0.56

POSITIVE LOGITS

+#+#

0.58

BASELINE

0.46

MIDDLEWARE

0.45

uevos

0.44

 SwitchCompat

0.42

μι

0.40

 strategy

0.39

◄

0.39

PhysRevLett

0.39

vertret

0.39

Activations Density 0.001%