INDEX

Explanations

Lie-starting words and names

The neuron activates on occurrences of the letter sequence “Lie” (as in names or the word “Lie”).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 bonbons

-0.83

stainless

-0.82

breakfast

-0.81

 getCategory

-0.81

CONSULT

-0.81

 تقديم

-0.78

Bim

-0.77

 zoll

-0.77

 Insufficient

-0.77

teenage

-0.77

POSITIVE LOGITS

 detector

1.16

Lie

1.12

utenants

1.02

Lie

0.97

Detector

0.91

haber

0.87

 Lipschitz

0.86

 detection

0.83

cama

0.82

 fallow

0.82

Activations Density 0.013%