INDEX
Explanations
sections of a document that express logical reasoning or argumentation about decision-making processes.
The neuron detects words and phrases that convey rushing or hasty action (e.g., “rush,” “rushing,” “rushed”).
New Auto-Interp
Negative Logits
Positive
-0.08
élé
-0.07
compte
-0.07
Decom
-0.07
resilient
-0.06
Find
-0.06
ivalent
-0.06
Prote
-0.06
comun
-0.06
protein
-0.06
POSITIVE LOGITS
rush
0.16
Rush
0.15
rushing
0.13
rushed
0.12
hurried
0.11
hurry
0.11
rushes
0.10
Hur
0.08
Hur
0.08
rush
0.08
Activations Density 0.005%