INDEX
Explanations
quantitative measures and comparisons
New Auto-Interp
Negative Logits
etty
-0.15
heel
-0.14
edelta
-0.14
ilon
-0.14
fewer
-0.14
Few
-0.13
HEEL
-0.13
Feinstein
-0.13
uali
-0.13
_HS
-0.13
POSITIVE LOGITS
fold
0.48
times
0.47
-fold
0.45
times
0.44
TIMES
0.43
åĢį
0.42
fold
0.41
-times
0.40
folds
0.40
Fold
0.39
Activations Density 0.080%