INDEX

Explanations

version numbers and email addresses

The neuron strongly activates on multi‐part numeric strings (especially version‐style numbers with digits separated by dots).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 spolu

-0.87

vého

-0.80

istered

-0.79

kru

-0.78

あの

-0.77

reon

-0.75

minar

-0.75

 Το

-0.75

bitol

-0.75

ELE

-0.73

POSITIVE LOGITS

empres

0.91

 השאלה

0.90

SÍ

0.88

 kadang

0.88

zdjęcie

0.86

 bazen

0.85

Ajust

0.83

 maniere

0.81

番外

0.80



0.80

Activations Density 0.043%