INDEX
Explanations
manipulate
The neuron peaks on words and stems related to manipulation or persuasive influence (e.g. “manipulate,” “manipulation,” “propaganda”).
New Auto-Interp
Negative Logits
夏
-0.07
Tea
-0.06
outset
-0.06
luck
-0.06
_clock
-0.06
Oprah
-0.06
Datetime
-0.06
sol
-0.06
источ
-0.06
itespace
-0.06
POSITIVE LOGITS
Methods
0.06
CGI
0.06
infiltr
0.06
reserve
0.06
PageSize
0.06
((&
0.06
_between
0.06
Fuller
0.06
mpeg
0.06
여자
0.06
Activations Density 0.014%