INDEX
    Explanations

    verbs indicating personal expression or interaction among characters

    New Auto-Interp
    Negative Logits
     때문에
    -0.42
     yelek
    -0.39
    
    -0.32
     π
    -0.30
    pi
    -0.29
    -0.29
    点了点头
    -0.29
    点点头
    -0.29
     nedeniyle
    -0.29
     lowers
    -0.28
    POSITIVE LOGITS
    GOTREF
    0.68
     OMITBAD
    0.67
    دانشنامهٔ
    0.64
     تضيفلها
    0.61
     мәкал
    0.60
    endphp
    0.58
     виправивши
    0.58
    ագրություններ
    0.58
     <<<<<<<<<<<<<<
    0.57
     Verſ
    0.57
    Act Density 0.308%

    No Known Activations