INDEX
    Explanations

    names of places and organizations

    New Auto-Interp
    Negative Logits
    staking
    -0.62
    roman
    -0.58
     forgiving
    -0.56
     lap
    -0.53
     allowances
    -0.53
     scratch
    -0.52
    borne
    -0.52
    pling
    -0.51
     blazing
    -0.51
     depress
    -0.50
    POSITIVE LOGITS
    a
    0.91
    o
    0.86
    icz
    0.85
    opol
    0.83
    iak
    0.82
    i
    0.81
    å
    0.81
    vous
    0.80
    din
    0.77
    theless
    0.76
    Act Density 1.156%

    No Known Activations