INDEX
    Explanations

    references to apologies and announcements related to personal or professional activities

    New Auto-Interp
    Negative Logits
    avra
    -0.16
    iyan
    -0.15
     addCriterion
    -0.14
    ÂŃi
    -0.14
    ÏĦÏħ
    -0.14
     à¹Ĩ
    -0.14
    .failure
    -0.14
    challenge
    -0.13
    ymb
    -0.13
    ISMATCH
    -0.13
    POSITIVE LOGITS
     fans
    0.26
     Fans
    0.25
     fan
    0.23
    Fans
    0.22
    fans
    0.22
     Fan
    0.19
     group
    0.19
     idols
    0.19
    fan
    0.19
     fandom
    0.18
    Act Density 0.001%

    No Known Activations