INDEX
Explanations
This neuron finds special characters used to segregate text
references to computer programming or technical concepts
New Auto-Interp
Negative Logits
hement
-0.92
destro
-0.78
glers
-0.73
nodd
-0.66
ende
-0.66
NRS
-0.66
estab
-0.65
charism
-0.65
Niet
-0.63
stad
-0.61
POSITIVE LOGITS
bus
0.82
shall
0.81
DATA
0.80
bable
0.79
INC
0.79
evidence
0.77
20439
0.77
successful
0.77
FACE
0.76
heter
0.75
Activations Density 0.125%