INDEX
    Explanations

    references to awards, recognition, or titles in various contexts

    New Auto-Interp
    Negative Logits
     Vác
    -0.14
    icari
    -0.14
    iple
    -0.14
    ABCDEFGHI
    -0.13
    exus
    -0.13
    rens
    -0.13
    اظ
    -0.13
    .getvalue
    -0.13
    ']="
    -0.12
    çĦ¼
    -0.12
    POSITIVE LOGITS
     give
    0.64
     giving
    0.60
     gave
    0.59
     given
    0.58
     Give
    0.56
    ç»Ļ
    0.55
    give
    0.54
     gives
    0.54
    Give
    0.53
    給
    0.52
    Act Density 0.553%

    No Known Activations