INDEX
Explanations
phrases that indicate a transition or change in state or time
New Auto-Interp
Negative Logits
zwar
-0.18
ossier
-0.17
.appspot
-0.16
heimer
-0.15
olini
-0.14
imary
-0.14
.jboss
-0.14
ãģ«ãĤĪ
-0.14
amaz
-0.14
/pg
-0.13
POSITIVE LOGITS
ultimately
0.25
being
0.25
Ultimately
0.20
eventually
0.20
finally
0.19
being
0.18
Being
0.17
Being
0.17
becoming
0.17
then
0.17
Activations Density 0.044%