INDEX
    Explanations

    phrases indicating capability or ability to perform tasks

    New Auto-Interp
    Negative Logits
     userSchema
    -0.72
    WireFormat
    -0.69
     velkommen
    -0.60
     Paglinawan
    -0.58
     quema
    -0.58
    Burns
    -0.58
    бище
    -0.57
    Kraj
    -0.57
     Majefty
    -0.57
     Numerology
    -0.57
    POSITIVE LOGITS
     can
    1.19
     Can
    1.15
    Can
    1.03
     able
    1.03
    又能
    0.93
     ability
    0.92
    can
    0.91
     CAN
    0.90
    ecan
    0.89
    0.89
    Act Density 0.126%

    No Known Activations