INDEX
Explanations
The neuron activates on category lines specifying a cause of death (e.g. “Deaths from …”).
New Auto-Interp
Negative Logits
positives
-0.07
Level
-0.07
IVING
-0.07
delta
-0.07
ッチ
-0.07
started
-0.06
sparkle
-0.06
simd
-0.06
Street
-0.06
Actions
-0.06
POSITIVE LOGITS
보호
0.06
enclave
0.06
searchModel
0.06
béné
0.06
Dst
0.05
PressEvent
0.05
orestation
0.05
hay
0.05
площад
0.05
Coupon
0.05
Activations Density 0.003%