INDEX

Explanations

conditions met

This neuron detects the instructional phrase “if applicable,” flagging when a conditional, optional instruction is introduced.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

if

-1.27

 fár

-1.15

 chaleco

-1.13

 that

-1.08

 surtido

-1.05

いざ

-1.05

 bocetos

-1.03



-1.03

 wenn

-0.99

 דער

-0.99

POSITIVE LOGITS

or

1.16

 yaitu

0.96

itola

0.92

 según

0.92

 додат

0.91

 конечно

0.88

Most

0.87

 Throughout

0.86

 While

0.86

 provide

0.85

Activations Density 0.046%