INDEX
Explanations
references to uniformity and standardization in various contexts
New Auto-Interp
Negative Logits
nt
-0.17
ors
-0.17
tin
-0.16
ive
-0.15
rian
-0.15
leigh
-0.15
breaking
-0.14
.CONFIG
-0.14
utenberg
-0.14
Fuller
-0.14
POSITIVE LOGITS
ities
0.21
ity
0.18
estead
0.17
eting
0.17
ication
0.16
bread
0.16
estar
0.16
è¡¡
0.16
iac
0.16
uito
0.16
Activations Density 0.052%