INDEX

Explanations

possessive pronouns followed by people

The neuron fires strongly on first‐person, self‐referential words (especially possessive pronouns like “my” or “own”), flagging personal statements or experiences.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

PRODUCT

0.33

 पेशेवरों

0.32

 জনসংখ্যার

0.31

汇总

0.31

 Specifies

0.30

 যাঁরা

0.30

*/

0.30

などの

0.30

に

0.29

Users

0.29

POSITIVE LOGITS

 girlfriend

0.96

 wife

0.93

 fiancée

0.91

 boyfriend

0.89

 fiancé

0.89

 grandmother

0.88

 fiance

0.83

 husband

0.83

 roommate

0.83

 niece

0.82

Activations Density 1.195%