INDEX

Explanations

distributed under the license

The neuron strongly activates on numeric tokens (multi‐digit numbers) in the text.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

падает

-0.79

 Rhys

-0.74

承受

-0.69

invasive

-0.67

trak

-0.67

ԁ

-0.66

nagel

-0.65

 lucha

-0.65

nsan

-0.65

 Finds

-0.65

POSITIVE LOGITS

 Nadine

0.74

コー

0.68

 Ewig

0.65

 otwar

0.64

 ос

0.64

 Isto

0.63

aduras

0.62

Continued

0.62

ولد

0.61

JQ

0.61

Activations Density 0.077%