INDEX
Explanations
The neuron is specialized to spot occurrences of the “max” operation (e.g. the token “max”) in code.
New Auto-Interp
Negative Logits
ウ
-0.07
timetable
-0.06
разом
-0.06
Ult
-0.06
Paleo
-0.06
。今
-0.06
Tele
-0.06
تقویت
-0.06
ời
-0.06
polit
-0.06
POSITIVE LOGITS
Aggregate
0.07
sponsored
0.07
aggregate
0.07
(Vector
0.07
ponsored
0.06
_maps
0.06
_NODES
0.06
predicates
0.06
potentials
0.06
mains
0.06
Activations Density 0.005%