INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     torso
    -0.07
    Dou
    -0.06
    -0.06
     Booster
    -0.06
     Dol
    -0.06
     Vendor
    -0.06
    Раз
    -0.06
     stiff
    -0.06
     Elves
    -0.06
     apprent
    -0.06
    POSITIVE LOGITS
    -path
    0.09
     Path
    0.09
    case
    0.08
     searcher
    0.07
     roadmap
    0.07
     path
    0.07
     ناب
    0.07
     тому
    0.07
    CLUDING
    0.07
     migrate
    0.06
    Act Density 0.008%

    No Known Activations