INDEX
Explanations
references to pendulums
New Auto-Interp
Negative Logits
unger
-0.16
utherford
-0.16
orns
-0.15
.Designer
-0.15
enha
-0.15
寸
-0.15
èŀº
-0.15
TEL
-0.14
Pie
-0.14
GURL
-0.14
POSITIVE LOGITS
pend
0.26
ulum
0.25
Pend
0.22
leton
0.21
suspended
0.18
pend
0.17
eton
0.17
PEND
0.17
lop
0.16
thon
0.16
Activations Density 0.011%