INDEX
Explanations
references to authors and their works or contributions
New Auto-Interp
Negative Logits
zv
-0.16
iggins
-0.14
tvrt
-0.14
ZF
-0.14
TRGL
-0.14
adian
-0.13
racat
-0.13
éf
-0.13
xF
-0.13
Zot
-0.13
POSITIVE LOGITS
Jae
0.38
Hy
0.36
Jung
0.36
Sung
0.36
Woo
0.36
Ky
0.34
Ye
0.34
Jong
0.34
Won
0.34
Young
0.33
Activations Density 0.043%