INDEX

Explanations

I and possessive pronouns

The neuron is tuned to words that signal personal, subjective expression—pronouns (“I”), modals (“would”), opinion verbs (“think,” “like”), and sentiment markers (“love,” “baby”) that mark a personal viewpoint.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 fellow

-1.40

ffion

-1.18

cznym

-1.16

 erforder

-1.15

 informée

-1.13

cestor

-1.09

 decorados

-1.09

ário

-1.09

 یا

-1.08

ℬ

-1.07

POSITIVE LOGITS

医药

1.17

 himself

1.17

who

1.14

⋗

1.11

 whom

1.01

だったが

1.00

 immense

0.99

些

0.98

 душу

0.97

 again

0.97

Activations Density 0.051%