INDEX
    Explanations

    adjectives related to characteristics or attitudes

    terms and phrases expressing emotional intensity and outward expressions

    New Auto-Interp
    Negative Logits
     Tanz
    -0.70
    PLIED
    -0.65
     ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
    -0.65
    noon
    -0.62
     Bastard
    -0.62
     Credits
    -0.61
    ammy
    -0.60
    Random
    -0.59
    olicy
    -0.58
    Frameworks
    -0.58
    POSITIVE LOGITS
    iations
    0.93
    ously
    0.84
    ous
    0.83
    atively
    0.82
    ologic
    0.78
    ographed
    0.77
    uously
    0.76
    iation
    0.74
    hips
    0.74
    hound
    0.72
    Act Density 0.097%

    No Known Activations