INDEX
Explanations
The neuron is triggered by mentions of “Drake,” i.e. the rapper’s name.
New Auto-Interp
Negative Logits
Curl
-0.07
bucket
-0.06
changing
-0.06
TT
-0.06
stup
-0.06
Jo
-0.06
Ship
-0.06
ordinary
-0.06
відом
-0.06
全部
-0.06
POSITIVE LOGITS
Drake
0.10
Wolverine
0.09
Deadpool
0.07
связи
0.07
leveraging
0.07
velice
0.06
.stride
0.06
.addRow
0.06
TRACE
0.06
SupportActionBar
0.06
Activations Density 0.001%