INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     motor
    -0.75
    motor
    -0.71
    amoan
    -0.63
     Motor
    -0.60
     motors
    -0.60
    romechanical
    -0.58
    Спасылкі
    -0.58
     MOTOR
    -0.57
    oflavin
    -0.56
     propOrder
    -0.56
    POSITIVE LOGITS
    awtextra
    0.68
     isolado
    0.57
     casero
    0.57
     caseros
    0.54
     superiores
    0.53
     ménage
    0.50
    wpi
    0.50
     wieś
    0.48
    0.47
     isolada
    0.47
    Act Density 0.016%

    No Known Activations