INDEX

Explanations

specific contexts or titles

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

buti

-0.82

cerer

-0.78

 видел

-0.78

 Mondays

-0.77

dungen

-0.77

кожа

-0.77

化する

-0.76

 reagieren

-0.75

 género

-0.74

nase

-0.74

POSITIVE LOGITS

 Silver

0.92

る

0.88

Silver

0.84

 totes

0.83

мал

0.82

フィー

0.80

 транспорт

0.80

 princes

0.79

anță

0.77

inos

0.76

Activations Density 0.017%