INDEX

Explanations

awe, marvel, wonder, admiration

The neuron responds to words that express strong emotional reactions—especially awe, wonder, fear, marvel, admiration, or astonishment.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 sweetest

-0.80

一听

-0.80

บาย

-0.77

bpm

-0.75

 substitution

-0.74

Epit

-0.74

canakan

-0.73

 temperature

-0.71

ActivityResult

-0.70

yı

-0.70

POSITIVE LOGITS

awe

3.28

 marvel

3.03

 amazement

2.80

 wonder

2.77

 amazed

2.75

awe

2.45

 admiration

2.17

 marvels

2.06

marvel

1.98

 fascinated

1.98

Activations Density 0.045%