INDEX

Explanations

script with, silhouette with

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 tandis

1.32

이며

1.19

։

1.19

。.

1.18

 sedangkan

1.18

 Sedangkan

1.18

。\

1.17

ค่ะ

1.16

。...

1.16

သည်။

1.12

POSITIVE LOGITS

—

0.88

 merely

0.87

 simply

0.87

not

0.87

 essentially

0.85

 basically

0.83

 unwittingly

0.80

 astray

0.80

 somehow

0.80

 repeatedly

0.79

Activations Density 0.204%