INDEX
Explanations
This neuron fires on words signaling quickness or immediacy, such as “instant,” “almost instant,” and similar time-speed adverbs.
New Auto-Interp
Negative Logits
applaud
-0.07
291
-0.06
conf
-0.06
Nissan
-0.06
creed
-0.06
hostage
-0.06
ע
-0.06
trad
-0.06
.WEST
-0.06
trans
-0.06
POSITIVE LOGITS
translate
0.06
Collections
0.06
)]; ↵
0.06
prung
0.06
Module
0.06
################################################################
0.06
etkisi
0.06
moderation
0.06
něj
0.06
.setFocus
0.06
Activations Density 0.138%