INDEX
    Explanations

    expressions indicating awareness and understanding of situations

    New Auto-Interp
    Negative Logits
     Monfieur
    -0.98
    -0.86
    AddAttribute
    -0.82
     dAtA
    -0.79
     Shakspeare
    -0.79
    CompleteListener
    -0.78
     Pokies
    -0.76
     XNUMX
    -0.75
     poffible
    -0.71
     pleaſure
    -0.71
    POSITIVE LOGITS
     know
    1.15
     knows
    1.12
    know
    1.05
    Know
    0.92
     Know
    0.92
     knew
    0.90
     understands
    0.89
    knows
    0.88
     recognize
    0.88
     understand
    0.87
    Act Density 0.281%

    No Known Activations