INDEX

Explanations

The neuron strongly activates on words that signal user-focused product capabilities or instructions—e.g. second-person pronouns and modals (“you,” “can,” “allows,” “may,” “also,” etc.)—marking up technical or promotional descriptions of features.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

例如

0.93

它

0.89

वादियों

0.88

ਗ

0.86

ਟ

0.84

稻

0.82

美國

0.82

ceding

0.81

insuku

0.81

达

0.80

POSITIVE LOGITS

 Bucure

1.55

 aproape

1.54

 pentru

1.52

 Pentru

1.48

 numai

1.45

 Cluj

1.44

 foarte

1.42

 România

1.41

 astfel

1.39

Pentru

1.34

Activations Density 0.062%