INDEX

Explanations

code assignment or variable names like `field` and `this`

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

[]={

-1.12

thmus

-1.10

,\,

-1.09

🫴

-1.04

bestos

-1.03

bited

-1.02

apnews

-1.02

回到了

-1.01

maining

-1.00

cleos

-1.00

POSITIVE LOGITS

1.81

}=\

1.09

set

1.06

jenigen

1.06

}=

1.05

one

1.04

 will

1.02

=\

1.02

块钱

0.99

>=</

0.98

Activations Density 0.001%