INDEX
Explanations
numerical values related to performance metrics or statistics
New Auto-Interp
Negative Logits
ounter
-0.17
pes
-0.15
xis
-0.14
olvers
-0.14
ursal
-0.14
kus
-0.14
lett
-0.14
ustr
-0.14
cles
-0.13
ÑĤен
-0.13
POSITIVE LOGITS
IID
0.18
eniz
0.16
ãĥ¼ãĥ©
0.16
ίÏĦ
0.15
ipa
0.15
anga
0.14
vox
0.14
Slave
0.14
stream
0.14
DISCLAIM
0.14
Activations Density 0.017%