INDEX

Explanations

questioning self-worth or external doubt

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

、（

1.14

 effectivement

1.06

 („

1.05

1.03

次

1.03

(*

0.97

*(

0.97

-(

0.92

เดียวกัน

0.90

gelijk

0.88

POSITIVE LOGITS

 untuk

1.67

 für

1.63

 của

1.58

<unused557>

1.56

 dari

1.54

of

1.52

 على

1.52

auf

1.51

<unused248>

1.51

for

1.51

Activations Density 0.551%