INDEX

Explanations

in/on followed by possessives

The neuron fires on emotionally or spiritually charged content words—nouns and verbs that signal inner feeling or positive affect (e.g. smile, soul, power, pleasure).

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

same

0.65

ständig

0.62

wt

0.58

 Champaign

0.58

avec

0.58

include

0.57

 رم

0.57

 avec

0.56

相同

0.56

 sauf

0.56

POSITIVE LOGITS

每个人

0.79

 intestines

0.72

 insanların

0.71

每一個

0.70

 oamen

0.69

 människor

0.69

眸

0.67

每一个

0.67

人们

0.66

每个

0.64

Activations Density 0.200%