INDEX
    Explanations

    proper nouns

    New Auto-Interp
    Negative Logits
     apart
    -0.46
    niem
    -0.45
     Recife
    -0.45
    -0.44
     Bochum
    -0.42
     Úl
    -0.42
     effected
    -0.42
     reik
    -0.42
     byz
    -0.42
     alps
    -0.42
    POSITIVE LOGITS
    Datuak
    0.88
    Демографія
    0.69
     AssemblyTitle
    0.65
    tagHelperRunner
    0.65
    enterOuterAlt
    0.64
     pinulongan
    0.63
    WarningLevel
    0.61
    Билгалдахарш
    0.61
    IGraphics
    0.60
     متعلقه
    0.59
    Act Density 0.001%

    No Known Activations