INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     foam
    -0.07
     싱글
    -0.06
     поли
    -0.06
    -commercial
    -0.06
     fracture
    -0.06
    Poster
    -0.06
     decided
    -0.06
    vak
    -0.05
    gL
    -0.05
    warts
    -0.05
    POSITIVE LOGITS
    �장
    0.07
     разви
    0.07
     الشيخ
    0.07
     Earlier
    0.07
    closing
    0.07
     appliance
    0.07
     Аб
    0.07
     Mathematical
    0.06
     Naples
    0.06
     関連
    0.06
    Act Density 0.041%

    No Known Activations