INDEX
Explanations
This neuron never activates and thus does not respond to any particular text pattern.
New Auto-Interp
Negative Logits
edo
-0.07
)));↵↵
-0.07
—an
-0.07
Bağ
-0.07
seizure
-0.07
ния
-0.07
Christmas
-0.06
ไฟ
-0.06
êu
-0.06
кі
-0.06
POSITIVE LOGITS
respectfully
0.07
Suche
0.06
igidBody
0.06
update
0.06
Muhammad
0.06
esteem
0.06
apologies
0.06
Pick
0.06
Cue
0.06
StatusBar
0.06
Activations Density 0.013%