INDEX
Explanations
percentages and numerical values
New Auto-Interp
Negative Logits
Rav
-0.71
kefeller
-0.64
Attributes
-0.63
flies
-0.62
ITED
-0.62
iden
-0.60
pload
-0.57
Merit
-0.57
ãĤ©
-0.56
hran
-0.56
POSITIVE LOGITS
ABV
0.93
xual
0.88
imet
0.80
iles
0.79
+.
0.79
+)
0.78
utilization
0.77
imeter
0.75
humidity
0.73
uary
0.70
Activations Density 0.829%