INDEX
    Explanations

    geographical places and locations

    New Auto-Interp
    Negative Logits
    -0.59
    -0.50
      
    -0.45
    <eos>
    -0.40
     na
    -0.39
     a
    -0.39
     best
    -0.34
     …
    -0.34
    <strong>
    -0.34
     sta
    -0.33
    POSITIVE LOGITS
     فريبيس
    1.09
     дописавши
    1.02
     Paglinawan
    0.99
    UnusedPrivate
    0.94
    ########.
    0.93
     disponibilités
    0.91
     Efq
    0.90
    లాలు
    0.88
    期刊论文
    0.87
    AndEndTag
    0.86
    Act Density 0.322%

    No Known Activations