INDEX
    Explanations

    the word "any" followed by an adjective or a verb

    phrases related to universally accepted judgments or assessments

    New Auto-Interp
    Negative Logits
    endas
    -0.94
    redients
    -0.72
    ributes
    -0.71
    appings
    -0.71
    combe
    -0.70
    oute
    -0.70
    rex
    -0.69
    icz
    -0.66
    rea
    -0.66
    ros
    -0.66
    POSITIVE LOGITS
    THING
    1.23
     conceivable
    1.10
     given
    1.03
     imaginable
    1.01
     semblance
    0.92
     sane
    0.90
    body
    0.88
    how
    0.87
     kind
    0.86
    ONE
    0.86
    Act Density 0.065%

    No Known Activations