INDEX
Explanations
references to beginnings and endings in narratives or events
New Auto-Interp
Negative Logits
FACT
-0.16
ories
-0.15
uts
-0.15
pedia
-0.15
èµ·æĿ¥
-0.15
ene
-0.14
entire
-0.14
fuse
-0.14
िà¤
-0.14
ouv
-0.14
POSITIVE LOGITS
stages
0.21
-middle
0.20
/end
0.18
mast
0.17
credits
0.16
/start
0.15
ìĭ¬
0.15
WebResponse
0.15
aviest
0.15
inning
0.15
Activations Density 0.054%