INDEX

Explanations

faithful, faithfully, faithfulness

The neuron detects occurrences of the word “faithful” (and its variant “faithfulness”).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

↵↵

-3.58

徧

-2.38

 immaculate

-2.23

 beliebt

-2.22

笵

-2.19

 variadas

-2.16

 trabajadores

-2.14

colgante

-2.14

 wundersch

-2.13

ocidade

-2.13

POSITIVE LOGITS

you

3.81

3.36

at

2.92

man

2.91

its

2.61

Ⲉ

2.50

most

2.47

“

2.45

</b>

2.44

get

2.44

Activations Density 0.004%