INDEX
Explanations
This neuron activates on words and phrases relating to taking care of or maintaining something (e.g., “care,” “maintenance,” “cultivating,” “requiring basic care”).
New Auto-Interp
Negative Logits
ecal
-0.07
олот
-0.07
ordo
-0.07
legitim
-0.06
#=
-0.06
ंब
-0.06
evangel
-0.06
')">
-0.06
trail
-0.06
yine
-0.06
POSITIVE LOGITS
Peng
0.06
ống
0.06
AutoSize
0.06
isOpen
0.06
ومات
0.06
0.06
کز
0.06
Ав
0.06
.Dropout
0.06
<Member
0.06
Activations Density 0.023%