INDEX
    Explanations

    phrases related to driving forces or motivations

    phrases that indicate causation or motivation

    New Auto-Interp
    Negative Logits
    enegger
    -0.71
     vantage
    -0.66
    antage
    -0.64
     pads
    -0.64
     Kinnikuman
    -0.64
    akings
    -0.64
    eport
    -0.63
     Canad
    -0.63
    umbn
    -0.62
    alon
    -0.61
    POSITIVE LOGITS
     by
    0.95
     BY
    0.90
    by
    0.88
    Ń·
    0.81
     solely
    0.80
     principally
    0.77
    By
    0.76
     aback
    0.75
     By
    0.74
     destro
    0.73
    Act Density 0.154%

    No Known Activations