INDEX
Explanations
The neuron detects mentions of specific clothing items or garments in the text.
New Auto-Interp
Negative Logits
cunning
-0.06
梁
-0.06
/class
-0.06
řej
-0.06
лит
-0.06
몰
-0.06
hảo
-0.06
referrals
-0.06
机会
-0.06
Guid
-0.06
POSITIVE LOGITS
、
0.06
olik
0.06
tı
0.06
,又
0.06
Responses
0.06
.setItems
0.06
대의
0.06
ağ
0.06
feasibility
0.06
liğinin
0.06
Activations Density 0.071%