INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     fees
    -0.09
    -0.08
     forage
    -0.08
     smartphones
    -0.08
     Grosso
    -0.08
    дых
    -0.08
     penalty
    -0.08
     असे
    -0.07
     photos
    -0.07
     frais
    -0.07
    POSITIVE LOGITS
     bipolar
    0.10
     Sarajevo
    0.10
     Bikini
    0.09
     Belfast
    0.09
     Dayton
    0.08
    PART
    0.08
     stal
    0.08
    共和
    0.08
     tear
    0.08
     civile
    0.08
    Act Density 0.006%

    No Known Activations