INDEX
Explanations
It appears that neuron 4 does not activate for any of the provided tokens, which suggests it might be looking for something not present in the provided text excerpts or that it might be malfunctioning or inactive
New Auto-Interp
Negative Logits
latest
-0.72
ache
-0.72
awa
-0.68
dylib
-0.67
notations
-0.67
++)
-0.66
residues
-0.64
-0.64
soever
-0.63
Ĥİ
-0.63
POSITIVE LOGITS
Watt
0.73
rouse
0.71
nect
0.68
uras
0.68
ner
0.67
Buk
0.62
aan
0.62
ners
0.62
slave
0.60
rament
0.59
Activations Density 0.000%
No Known Activations
This feature has no known activations.