INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     autorytatywna
    -0.91
     Roskov
    -0.85
     estekak
    -0.85
     doInBackground
    -0.84
    BeginInit
    -0.84
    istoitu
    -0.81
    WebElementEntity
    -0.78
    Jeografia
    -0.77
    TagMode
    -0.77
    ConstraintMaker
    -0.77
    POSITIVE LOGITS
     reference
    0.93
     check
    0.79
     control
    0.69
     references
    0.65
     training
    0.59
     refer
    0.56
     yards
    0.55
    reference
    0.54
    check
    0.54
     comparison
    0.54
    Act Density 0.003%

    No Known Activations