INDEX

Explanations

reduce risk and problems

The neuron highlights words naming harmful or undesirable conditions or risks (e.g. bleeding, spam, thrombogenicity, risk).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

OrUpdate

-0.98

tarjetas

-0.93

apertura

-0.91

⤵

-0.83

isAuth

-0.82

Durée

-0.82

 quivering

-0.81

 permettra

-0.80

损

-0.79

𔘓

-0.79

POSITIVE LOGITS

 Erzä

0.91

 avoid

0.89

avoid

0.88

ہ

0.84

 избе

0.83

ENDS

0.80

dhury

0.79

ktes

0.79

kait

0.79

 superfí

0.78

Activations Density 0.105%