INDEX
Explanations
This neuron detects references to online resources, specifically web links or URLs.
New Auto-Interp
Negative Logits
Oro
-0.07
py
-0.06
的时候
-0.06
perception
-0.06
ook
-0.06
601
-0.06
ości
-0.06
airport
-0.06
Leaf
-0.06
beach
-0.06
POSITIVE LOGITS
biting
0.07
Servers
0.07
Women
0.07
.bt
0.07
carved
0.06
sway
0.06
(true
0.06
Οκ
0.06
合格
0.06
Labor
0.06
Activations Density 0.014%