INDEX

Explanations

Ernst Ferdinand | what we | Q: | Our training

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

拜

-0.89

 واحد

-0.85

 tài

-0.81

Ẻ

-0.81

dump

-0.76

Confira

-0.75

uggles

-0.73

 Tài

-0.73

大学生

-0.72

yendo

-0.71

POSITIVE LOGITS

currentPosition

0.84

 verder

0.84

 Shaw

0.77

anter

0.77

ógicos

0.75

 stuks

0.75

Fc

0.74

 enligt

0.74

anic

0.73

まして

0.73

Activations Density 0.010%