INDEX

Explanations

the exact word "Den" (capital D) in the text.

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 myſelf

-1.89

 itſelf

-1.80

Efq

-1.79

 Monfieur

-1.70

 Theſe

-1.66

 themſelves

-1.65

 pleaſure

-1.65

 himſelf

-1.61

 Anſ

-1.60

 Houſe

-1.57

POSITIVE LOGITS

de

1.00

del

0.90

en

0.85

0.81

0.77

und

0.76

di

0.75

par

0.74

des

0.74

du

0.74

Activations Density 0.002%