INDEX
    Explanations

    expressions related to gratitude and appreciation

    New Auto-Interp
    Negative Logits
    oine
    -0.15
    edio
    -0.15
    igue
    -0.15
    ventus
    -0.15
    ouz
    -0.14
    .rdf
    -0.14
     Bilim
    -0.14
    ione
    -0.14
    odate
    -0.14
    andbox
    -0.14
    POSITIVE LOGITS
    ola
    0.20
     Lagos
    0.19
     Ol
    0.19
    emi
    0.19
     Ade
    0.18
    iola
    0.18
    erin
    0.17
    ipe
    0.17
    ADED
    0.17
    OLA
    0.17
    Act Density 0.071%

    No Known Activations