INDEX

Explanations

personal states or feelings

This neuron fires on first-person expressions of annoyance, displeasure, or puzzlement (e.g. “pisses me off,” “bothers me,” “puzzles me”).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

for

-1.30

 silencioso

-1.07

 around

-1.05

on

-1.04

or

-1.04

and

-1.02

can

-0.99

 with

-0.96

 koş

-0.95

 administrativo

-0.93

POSITIVE LOGITS

 promin

1.15

 SPOILERS

1.14

牘

1.14

 והוא

1.14

 apparaissent

1.14

 intermin

1.11

 ruines

1.10

γι

1.10

Cooperation

1.09

ebly

1.08

Activations Density 0.034%