INDEX

Explanations

don't be something negative

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 that

-2.36

it

-2.31

甞

-2.11

过年

-1.95

羕

-1.92

៚

-1.85

ꯠ

-1.81

 veicolo

-1.81

when

-1.80

ではでは

-1.77

POSITIVE LOGITS

 هیچ

1.73

1.57

fri

1.55

はいけない

1.54

 بعض

1.51

 những

1.49

朽

1.48

<bos>

1.46

})=\

1.45

 chill

1.45

Activations Density 0.007%