INDEX
Explanations
references to demographic and historical recordkeeping
New Auto-Interp
Negative Logits
unary
-0.16
ÙĦÙĬÙģ
-0.15
asty
-0.15
olit
-0.14
etro
-0.14
riday
-0.13
rani
-0.13
Eigen
-0.13
заÑĤ
-0.13
util
-0.13
POSITIVE LOGITS
micro
0.20
Micro
0.20
Micro
0.20
micro
0.19
_micro
0.18
Batch
0.18
Batch
0.17
GS
0.17
batch
0.17
Film
0.16
Activations Density 0.015%