INDEX
    Explanations

    proper nouns, particularly names and geographical entities

    New Auto-Interp
    Negative Logits
     للمعارف
    -0.86
    AndEndTag
    -0.65
    حياتها
    -0.63
    ніципалі
    -0.56
    yntaxException
    -0.56
     дописавши
    -0.56
     ostavi
    -0.56
    forChild
    -0.56
    Fordítás
    -0.54
     Himo
    -0.52
    POSITIVE LOGITS
    Välislingid
    0.48
     võ
    0.47
     peaks
    0.46
     mõ
    0.45
     eel
    0.44
     rõ
    0.44
     Võ
    0.43
     amet
    0.43
    Viited
    0.42
     või
    0.42
    Act Density 0.109%

    No Known Activations