INDEX
Explanations
numerical equality and comparisons
New Auto-Interp
Negative Logits
asel
-0.85
hari
-0.72
beh
-0.71
avorite
-0.65
é¾įå
-0.64
oji
-0.62
Beh
-0.61
aptic
-0.61
stal
-0.61
////////////////
-0.60
POSITIVE LOGITS
proportions
0.80
inity
0.77
izes
0.75
amount
0.73
ized
0.73
TOTAL
0.72
ivalent
0.72
MPG
0.69
izing
0.69
Amount
0.68
Activations Density 0.015%