INDEX
Explanations
This neuron activates on occurrences of the “private” access modifier in code.
New Auto-Interp
Negative Logits
plot
-0.07
users
-0.06
crowds
-0.06
students
-0.06
odds
-0.06
.tem
-0.06
lease
-0.06
Verizon
-0.06
.pad
-0.06
버전
-0.06
POSITIVE LOGITS
){↵↵0.06
ch
0.06
ρχ
0.06
ancements
0.06
ewood
0.06
ouchers
0.06
-An
0.06
ourced
0.06
ponder
0.06
示例
0.06
Activations Density 0.002%