INDEX

Explanations

conditionals starting with would

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 will

-2.42

can

-1.39

 when

-1.29

 Traditionally

-1.19

 nantinya

-1.14

 ће

-1.10

 jeśli

-1.09

 unusually

-1.08

 although

-1.05

 When

-1.05

POSITIVE LOGITS

 wouldn

1.85

 would

1.70

wouldn

1.48

 today

1.38

 instead

1.34

 serait

1.30

probably

1.27

Wouldn

1.27

Instead

1.24

 wouldnt

1.24

Activations Density 0.030%