INDEX
Explanations
The neuron activates on words or word fragments that mean “selection” or “choosing.”
New Auto-Interp
Negative Logits
İmparator
-0.07
JPanel
-0.06
lbl
-0.06
com
-0.06
راق
-0.06
box
-0.06
.ReadFile
-0.06
розповід
-0.06
alert
-0.05
feito
-0.05
POSITIVE LOGITS
selecting
0.10
chose
0.09
selected
0.09
Choosing
0.09
choose
0.09
choosing
0.09
Picks
0.08
picks
0.07
Choosing
0.07
pick
0.07
Activations Density 0.040%