INDEX
    Explanations

    evaluative adjectives and descriptors that characterize objects or experiences

    New Auto-Interp
    Negative Logits
     the
    -1.19
     CreateTagHelper
    -0.93
     its
    -0.91
     both
    -0.89
     their
    -0.88
     this
    -0.87
     some
    -0.86
     basically
    -0.86
     those
    -0.85
     our
    -0.83
    POSITIVE LOGITS
    ,
    1.04
    ly
    0.98
    ized
    0.88
     but
    0.87
    yet
    0.86
     yet
    0.80
    but
    0.77
     pero
    0.76
    ally
    0.74
     and
    0.72
    Act Density 2.898%

    No Known Activations