INDEX
    Explanations

    adjectives expressing opinions or evaluations about things or people

    words related to characterization and opinions

    New Auto-Interp
    Negative Logits
    ammy
    -0.66
    hoff
    -0.64
    cum
    -0.64
    nir
    -0.64
    perty
    -0.62
    neau
    -0.60
    stration
    -0.60
    externalActionCode
    -0.59
    kr
    -0.59
    isoft
    -0.59
    POSITIVE LOGITS
    phas
    0.94
     ourselves
    0.83
     oneself
    0.79
    encies
    0.78
     it
    0.77
    Ī
    0.77
    enance
    0.76
    pointers
    0.74
     themselves
    0.74
    tones
    0.73
    Act Density 0.199%

    No Known Activations