INDEX

Explanations

speaking in first person

The neuron detects tokens where the author refers to themselves in the first person (e.g. “I,” “I’ve,” “I’m,” etc.).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

is

-2.61

</em>

-2.59

of

-2.45

has

-2.41

-2.33

-2.13

-2.03

</h2>

-2.03

had

-2.02

 then

-1.97

POSITIVE LOGITS

ἀ

2.16

睒

2.08

 Reverso

1.99

獁

1.99

‘‘

1.97

ፑ

1.96

 particules

1.94



1.93

 bors

1.91

瞜

1.91

Activations Density 0.023%