INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
    ecký
    -0.06
    Planet
    -0.06
    ная
    -0.06
          		
    -0.06
    ühl
    -0.06
                
    -0.06
    -0.06
     neby
    -0.06
     obstacles
    -0.06
    POSITIVE LOGITS
     firm
    0.21
     firms
    0.17
     Firm
    0.17
    firm
    0.11
    irms
    0.09
     firmalar
    0.09
     firma
    0.09
     Fir
    0.08
    fir
    0.07
     Frank
    0.07
    Act Density 0.006%

    No Known Activations