INDEX

Explanations

positive sentiment closers

The neuron is triggered by polite, self‐referential or apologetic language—especially first‐person pronouns and words like “sorry,” “help,” “I,” “but,” and emphatic punctuation.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

،

0.73

，

0.72

?",

0.67

?,

0.64

:'',

0.64

NAMEN

0.63

0.61

это

0.61

?',

0.60

،

0.60

POSITIVE LOGITS

 🤗

0.90

 🙂

0.88

 😊

0.87

 👍

0.86

:)

0.84

 😉

0.83

 💪

0.79

;)

0.78

 😎

0.77

 👌

0.74

Activations Density 0.359%