INDEX

Explanations

code filenames and imports

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 then

-1.87

 mismos

-1.70

ſſen

-1.63

がこの

-1.61

it

-1.60

 этом

-1.59

側の

-1.59

 this

-1.58

 rano

-1.58

 kuchen

-1.56

POSITIVE LOGITS

 даже

2.06

Spesifikasi

2.03

Pengertian

1.99

 Even

1.95

 Often

1.77

Even

1.77

 حتی

1.77

 ответить

1.75

мл

1.75

 When

1.74

Activations Density 0.028%