INDEX
Explanations
references to specific science fiction themes or titles
New Auto-Interp
Negative Logits
lea
-0.17
Į¨
-0.15
965
-0.15
furt
-0.14
erno
-0.14
lun
-0.14
ledged
-0.14
orte
-0.14
ulo
-0.14
numRows
-0.14
POSITIVE LOGITS
Sy
0.30
dney
0.29
sy
0.29
Sy
0.24
.sy
0.24
nergy
0.23
bil
0.21
nergie
0.21
ringe
0.21
posium
0.20
Activations Density 0.017%