INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Anh
    -0.08
    836
    -0.08
     Crisp
    -0.08
     Zur
    -0.07
     Kurd
    -0.07
     Pry
    -0.07
     Duke
    -0.07
     Odds
    -0.07
    &type
    -0.07
    itaine
    -0.07
    POSITIVE LOGITS
    0.08
    -handed
    0.08
     опера
    0.08
    oct
    0.08
    war
    0.08
    0.08
    ър
    0.07
     merc
    0.07
    landing
    0.07
     платеж
    0.07
    Act Density 0.010%

    No Known Activations