INDEX

Explanations

things that are not right

The neuron specifically detects occurrences of the word “undue” (marking something excessive or unwarranted).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 From

-2.20

 Another

-2.14

 usai

-2.11

蠵

-2.06

 Then

-2.03

 caneca

-2.00

 passaggio

-1.99

 Three

-1.95

土曜

-1.94

蔼

-1.94

POSITIVE LOGITS

2.16

の姿

2.02

1.93

jenige

1.89

 это

1.89

啬

1.88

ка

1.81

↵↵

1.80

 possam

1.77

amp

1.75

Activations Density 0.002%