INDEX
Explanations
polls and voting-related actions.
The neuron detects mentions of voting or polls—i.e. words used to invite or report user votes (e.g. “vote,” “poll,” “voting”).
New Auto-Interp
Negative Logits
prompts
-0.07
Fak
-0.07
dream
-0.07
pruning
-0.06
�
-0.06
ак
-0.06
多少
-0.06
McC
-0.06
()).
-0.06
orc
-0.06
POSITIVE LOGITS
crease
0.07
licant
0.07
courte
0.06
ισμού
0.06
Vote
0.06
إد
0.06
бла
0.06
муницип
0.06
arine
0.06
.hamcrest
0.06
Activations Density 0.012%