INDEX
Explanations
percentages or proportions
references to percentages and proportions
New Auto-Interp
Negative Logits
worldly
-0.92
åĤ
-0.79
zie
-0.78
ipples
-0.73
nings
-0.72
Skies
-0.69
gest
-0.68
bro
-0.67
æ©
-0.67
andr
-0.67
POSITIVE LOGITS
Invisible
0.98
certainty
0.94
accuracy
0.92
accurate
0.91
humidity
0.86
completion
0.84
pure
0.79
chance
0.79
confidence
0.77
efficient
0.76
Activations Density 0.070%