INDEX
    Explanations

    phrases or clauses that express causality

    references to reasons and justifications

    New Auto-Interp
    Negative Logits
    Enlarge
    -0.51
     UNITED
    -0.46
    BuyableInstoreAndOnline
    -0.45
     Belfast
    -0.42
    iage
    -0.42
    士
    -0.40
     Colombian
    -0.40
     laun
    -0.40
    ITIES
    -0.40
    interstitial
    -0.39
    POSITIVE LOGITS
    esides
    0.49
     luck
    0.49
     disclaim
    0.46
    versely
    0.46
     spite
    0.45
    elo
    0.45
    asty
    0.44
     guessed
    0.43
     paraph
    0.43
    alle
    0.42
    Act Density 3.727%

    No Known Activations