INDEX
Explanations
references to pendulums and related concepts
New Auto-Interp
Negative Logits
smith
-0.19
utherford
-0.17
herits
-0.17
ë§¥
-0.16
iefs
-0.15
altung
-0.14
寸
-0.14
éĨ
-0.14
er
-0.14
/***/
-0.14
POSITIVE LOGITS
ulum
0.30
leton
0.28
pend
0.24
Pend
0.22
ennis
0.21
ulous
0.19
pend
0.18
PEND
0.18
ragon
0.18
Ore
0.16
Activations Density 0.007%