INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     numa
    -0.07
     on
    -0.07
    เพล
    -0.06
     Reynolds
    -0.06
     primeiro
    -0.06
     ob
    -0.06
     IPA
    -0.06
    FX
    -0.06
     likes
    -0.06
     lofty
    -0.06
    POSITIVE LOGITS
     bacteria
    0.11
     bacterial
    0.10
     bacter
    0.09
     бактер
    0.09
     institution
    0.08
     references
    0.08
    		            
    0.07
    fish
    0.07
    бач
    0.07
     yabancı
    0.07
    Act Density 0.011%

    No Known Activations