INDEX

Explanations

Guth/Gut/Gutt names and utter/utta

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits



-3.17

킁

-2.72

猻

-2.67

艄

-2.41

呶

-2.34

㍑

-2.34



-2.34

 doigt

-2.28

屺

-2.27

obviously

-2.23

POSITIVE LOGITS

↵

3.06

 When

2.69

外的

2.59

2.56

 prácticamente

2.39

 Other

2.30

 That

2.14

遢

2.13

</h1>

2.08

).

2.08

Activations Density 0.001%