INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     threw
    -0.07
    economic
    -0.07
     machinery
    -0.07
    Shim
    -0.07
    .PREFERRED
    -0.07
     spinning
    -0.07
    ору
    -0.07
    goods
    -0.07
     royalty
    -0.07
    -0.07
    POSITIVE LOGITS
     gros
    0.09
     sud
    0.08
    0.08
    inse
    0.08
     വരുന്ന
    0.08
     halve
    0.07
    0.07
     yapı
    0.07
     трав
    0.07
     Building
    0.07
    Act Density 0.021%

    No Known Activations