INDEX

Explanations

benefit risk cost

The neuron activates on numeric and quantitative terms—numbers, percentages, and metric words like “risk,” “cost,” “benefit,” and “ratio.”

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

気が

-0.88

othermal

-0.84

₁(

-0.82

 geladen

-0.80

tructuring

-0.80

prehensive

-0.79

umumkan

-0.78

truct

-0.77

ELAND

-0.75

 siè

-0.75

POSITIVE LOGITS

 benefits

3.94

 benefit

3.84

 Benefits

3.17

 Benefit

3.09

Benefit

3.08

benefit

3.03

benefits

2.95

Benefits

2.81

 risk

2.56

 BENEFITS

2.30

Activations Density 0.066%