INDEX
    Explanations

    events following an action

    New Auto-Interp
    Negative Logits
     that
    -1.66
     any
    -1.27
     has
    -1.26
     will
    -1.23
     might
    -1.23
     had
    -1.20
     all
    -1.16
     such
    -1.13
     would
    -1.12
     have
    -1.08
    POSITIVE LOGITS
     being
    1.93
     deem
    1.39
     becoming
    1.34
     confess
    1.30
    being
    1.23
     orchestr
    1.21
     unsuccessfully
    1.18
     ſte
    1.17
     controversi
    1.16
     étant
    1.16
    Act Density 0.062%

    No Known Activations