INDEX
Explanations
This neuron activates on phrases expressing per-unit rates or ratios (e.g. “entrada por cada …”).
New Auto-Interp
Negative Logits
("<-0.07
assist
-0.06
Summary
-0.06
seront
-0.06
ixed
-0.06
Airlines
-0.06
ebiliriz
-0.06
hair
-0.06
Pooling
-0.06
reducing
-0.06
POSITIVE LOGITS
extravag
0.06
votes
0.06
entries
0.06
Rows
0.06
ดร
0.06
-ranging
0.06
ене
0.06
getClass
0.06
TableColumn
0.06
.CON
0.06
Activations Density 0.028%