INDEX
    Explanations

    phrases related to actions and capabilities

    phrases indicating conditional situations or consequences

    New Auto-Interp
    Negative Logits
    raft
    -0.77
    ¯
    -0.73
    mx
    -0.71
    ode
    -0.67
    neg
    -0.65
    owl
    -0.64
    dds
    -0.64
    uga
    -0.63
    sav
    -0.63
    MSN
    -0.62
    POSITIVE LOGITS
     accordingly
    1.04
     thereafter
    0.90
     alike
    0.72
     consequently
    0.70
     consequ
    0.67
     advoc
    0.66
     notor
    0.66
     reused
    0.66
     afterwards
    0.65
     thereof
    0.64
    Act Density 0.808%

    No Known Activations