INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     브랜드
    -0.08
    sof
    -0.08
     Fabr
    -0.08
     aset
    -0.08
     heightened
    -0.08
     hohen
    -0.08
    ปี
    -0.07
     telev
    -0.07
     spol
    -0.07
     Turkey
    -0.07
    POSITIVE LOGITS
    ו�
    0.08
     სა�
    0.08
    0.08
    0.08
    licas
    0.08
     مطلوب
    0.08
    Categories
    0.08
     რაი
    0.08
    ereco
    0.08
    ̊
    0.08
    Act Density 0.006%

    No Known Activations