INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     we
    -0.59
     kita
    -0.47
     and
    -0.47
     nous
    -0.45
     I
    -0.44
     azt
    -0.44
     bross
    -0.43
     our
    -0.43
     pollutants
    -0.43
     нами
    -0.43
    POSITIVE LOGITS
     CreateTagHelper
    1.02
     autorytatywna
    0.91
    ftagPool
    0.82
     initComponents
    0.82
    findpost
    0.81
    ✨:
    0.78
    )";
    
    0.75
    ihnachts
    0.74
    +:+
    0.74
    oredCriteria
    0.74
    Act Density 0.055%

    No Known Activations