INDEX
    Explanations

    phrases related to actions and intentions of people

    connections between actions and consequences

    New Auto-Interp
    Negative Logits
    rique
    -0.89
    asio
    -0.77
     fronts
    -0.76
    rers
    -0.66
    hoe
    -0.65
    lance
    -0.62
     reinstated
    -0.62
    hei
    -0.61
    onde
    -0.60
     holders
    -0.60
    POSITIVE LOGITS
     namely
    0.92
     viz
    0.78
    excluding
    0.74
     Whether
    0.73
    Magikarp
    0.71
     Including
    0.70
     except
    0.67
    whether
    0.65
     INCLUD
    0.64
    BUT
    0.64
    Act Density 0.760%

    No Known Activations