INDEX

Explanations

preventing or inhibiting actions

This neuron fires on occurrences of the negation “won’t” (the “t” in “won’t”) and its associated verbs (as in “won’t give up” or “won’t shut its doors”).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 rhino

-0.98

सि

-0.86

Approve

-0.85

 établissements

-0.82

 reinstatement

-0.82

݂

-0.81

 fave

-0.81

が多

-0.80

 apparition

-0.79

 vilka

-0.79

POSITIVE LOGITS

 restrictions

1.09

 inhibitors

1.01

 inhibitor

0.84

＾

0.84

 with

0.84

 brakes

0.80

 inhibition

0.80

 inhibitory

0.79

̑

0.77

 deleteById

0.77

Activations Density 0.072%