INDEX
    Explanations

    phrases that indicate necessity or point out the importance of actions or concepts

    New Auto-Interp
    Negative Logits
    IVEREF
    -0.86
     esternos
    -0.71
     poffe
    -0.68
    addCriterion
    -0.64
     oprot
    -0.62
     виправивши
    -0.60
    ázaro
    -0.60
     autorytatywna
    -0.60
    Jereo
    -0.58
    PyExc
    -0.58
    POSITIVE LOGITS
     Нужно
    0.83
     faut
    0.83
     bisogna
    0.81
     Надо
    0.78
     perlu
    0.77
    Надо
    0.76
     trzeba
    0.74
     occorre
    0.73
     należy
    0.72
     следует
    0.72
    Act Density 0.410%

    No Known Activations