INDEX
Explanations
sentences transitioning to a new topic or section in a text
box office revenue figures of movies
New Auto-Interp
Negative Logits
..."
-0.30
!".
-0.27
)."
-0.26
."
-0.26
[...]
-0.25
]."
-0.23
".
-0.23
".[
-0.23
?".
-0.23
.).
-0.23
POSITIVE LOGITS
esides
0.21
atform
0.21
ivot
0.20
roups
0.19
olit
0.18
raltar
0.17
onents
0.17
ouple
0.17
rina
0.17
argo
0.16
Activations Density 6.435%