INDEX
    Explanations

    phrases related to contributions and significant impacts

    New Auto-Interp
    Negative Logits
    ismet
    -0.14
    ries
    -0.14
    edom
    -0.14
    inaire
    -0.13
    kees
    -0.13
    rahim
    -0.13
    ãģ¡ãĤī
    -0.13
    лÑĸÑĤ
    -0.13
    omp
    -0.13
    eref
    -0.13
    POSITIVE LOGITS
     contribution
    1.13
     contributions
    1.10
    contrib
    0.94
     Contribution
    0.94
     Contributions
    0.90
    Contrib
    0.89
     contrib
    0.87
     contribute
    0.84
     contributed
    0.84
     contributing
    0.81
    Act Density 0.180%

    No Known Activations