INDEX

Explanations

I/we need

The neuron fires on the first‐person singular pronoun “I,” i.e. spots where the author is speaking about themselves.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 commissioned

0.65

 periods

0.58

 resided

0.55

 inventive

0.55

 possessing

0.55

 enthusiasts

0.54

 whilst

0.54

Ћ

0.53

 enthusiast

0.52

 inhomogeneities

0.52

POSITIVE LOGITS

 usually

0.77

 সাধারণত

0.75

 Usually

0.73

usually

0.71

 biasanya

0.71

 Обычно

0.69

Usually

0.67

 Biasanya

0.67

 تقوم

0.66

 ALWAYS

0.66

Activations Density 0.282%