INDEX
Explanations
numerical comparisons indicating quantity
New Auto-Interp
Negative Logits
inventoryQuantity
-0.63
behavi
-0.56
seriousness
-0.56
quickShipAvailable
-0.55
âĸ¬
-0.54
condem
-0.52
mination
-0.52
Clicker
-0.52
trave
-0.52
destro
-0.52
POSITIVE LOGITS
850
1.12
670
1.06
900
1.06
800
1.06
700
1.05
550
1.04
620
1.03
450
1.02
600
1.02
650
1.01
Activations Density 0.106%