INDEX
    Explanations

    coordinates, paying, Employee, Risk, time, repel

    New Auto-Interp
    Negative Logits
    ivă
    0.46
    ERNAL
    0.45
     Лондон
    0.42
    PDF
    0.41
    Heb
    0.40
    attaa
    0.40
    London
    0.39
    ICK
    0.39
    DC
    0.38
    UEN
    0.38
    POSITIVE LOGITS
     suv
    0.55
     bonne
    0.51
     dün
    0.50
     adatt
    0.50
     tanpa
    0.49
     genial
    0.48
     git
    0.48
     soins
    0.48
     pow
    0.48
     imposs
    0.48
    Act Density 0.068%

    No Known Activations