INDEX
    Explanations

    references to gratitude and support

    New Auto-Interp
    Negative Logits
    oli
    -0.16
    ubbo
    -0.15
    सन
    -0.14
    ç·Ĵ
    -0.14
    ordo
    -0.14
    ordin
    -0.14
     erle
    -0.14
    íĹ
    -0.14
     affairs
    -0.14
    ordion
    -0.14
    POSITIVE LOGITS
     efforts
    0.26
     effort
    0.24
     contribution
    0.23
     contributions
    0.21
     help
    0.20
     continued
    0.18
     work
    0.18
     trouble
    0.18
    oyal
    0.17
     handling
    0.17
    Act Density 0.058%

    No Known Activations