INDEX
Explanations
I am unable to provide a summary for Neuron 4 at this time. There seems to be a technical issue with the data
references to the concept of merits or virtues
New Auto-Interp
Negative Logits
yss
-0.72
shr
-0.69
eden
-0.69
URA
-0.68
quickShipAvailable
-0.64
gdala
-0.63
alien
-0.61
hr
-0.61
overed
-0.61
zie
-0.61
POSITIVE LOGITS
merits
1.41
merit
0.98
manship
0.90
judgement
0.89
judgments
0.82
avorite
0.81
aint
0.77
folios
0.75
plaus
0.73
guiActiveUn
0.73
Activations Density 0.010%