INDEX
    Explanations

    phrases related to contributions and experiences of individuals

    New Auto-Interp
    Negative Logits
    oug
    -0.19
    ãĥ¥
    -0.17
    elib
    -0.17
    ampions
    -0.15
    Periph
    -0.14
    achuset
    -0.14
    olean
    -0.14
    icker
    -0.14
    oser
    -0.14
    ijk
    -0.14
    POSITIVE LOGITS
     contribution
    0.24
     contributions
    0.23
     Contributions
    0.20
     Contribution
    0.19
    etz
    0.18
     bringing
    0.18
     contribute
    0.18
    bringing
    0.18
    帶
    0.17
    带
    0.16
    Act Density 0.076%

    No Known Activations