INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     hoof
    -0.07
    /:
    -0.07
    是不
    -0.07
     broadcasting
    -0.07
     view
    -0.07
     voice
    -0.07
     horsepower
    -0.07
     orphan
    -0.07
     основі
    -0.06
     göre
    -0.06
    POSITIVE LOGITS
     settle
    0.17
     settled
    0.17
     settling
    0.15
     settles
    0.11
     Settlement
    0.11
     settlement
    0.11
     settlements
    0.09
     resett
    0.08
     Quyết
    0.07
    0.07
    Act Density 0.006%

    No Known Activations