INDEX

Explanations

not

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 GenerationType

-0.76

Getting

-0.69

 Getting

-0.68

Adding

-0.64

 Obtaining

-0.63

having

-0.63

getting

-0.62

 EconPapers

-0.62

 GETTING

-0.62

 saites

-0.60

POSITIVE LOGITS

not

1.54

not

1.05

NOT

0.89

Not

0.77

Not

0.74

 bukan

0.68

NOT

0.67

 mitte

0.61

 ikke

0.60

noty

0.60

Activations Density 0.001%