INDEX

Explanations

AI assistant refusal

The main thing this neuron does is detect standalone numeric tokens (numbers).

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

‫

0.64

0.62

ma

0.61

 Fossil

0.59

 Pills

0.57

の

0.55

 Promenade

0.55

 fossil

0.54

 بھی

0.54

 Would

0.54

POSITIVE LOGITS

Disclaimer

0.75

jekt

0.69

 первое

0.69

NOTA

0.66

NOTE

0.66

сі

0.66

explanatory

0.66

᱑

0.64

ankind

0.64

BEGIN

0.64

Activations Density 0.035%