INDEX
Explanations
references to notable authors and their works
New Auto-Interp
Negative Logits
lorette
-0.62
'{}-0.56
ouret
-0.56
extr
-0.54
endung
-0.54
Buxton
-0.53
utafitiHapana
-0.53
eado
-0.53
EXTR
-0.53
Baumann
-0.52
POSITIVE LOGITS
"]";
0.55
kegaard
0.54
himself
0.52
UserScript
0.51
Vergine
0.51
esque
0.50
Italijani
0.50
__':
0.49
BeginInit
0.49
__':
0.49
Activations Density 0.280%