INDEX
    Explanations

    phrases related to specific names or titles

    New Auto-Interp
    Negative Logits
    İstinadlar
    -0.60
    Manbalar
    -0.54
    weebly
    -0.52
    livejournal
    -0.47
     éclairage
    -0.43
    sidemargin
    -0.43
    Izvori
    -0.43
    Prijs
    -0.43
    NamedQuery
    -0.43
    Gemeinsame
    -0.43
    POSITIVE LOGITS
     guir
    0.68
     applau
    0.67
     sappi
    0.66
     jajaja
    0.65
    trás
    0.65
     apparti
    0.63
     Ottobre
    0.62
    ificance
    0.62
    érêt
    0.62
     sés
    0.62
    Act Density 0.277%

    No Known Activations