INDEX
    Explanations

    words related to reasoning, justification, and explanation

    phrases indicating the existence or state of being

    New Auto-Interp
    Negative Logits
     Inventory
    -0.76
    ÄŁ
    -0.75
    lez
    -0.72
    glers
    -0.71
    eks
    -0.70
     congratulations
    -0.69
    lash
    -0.65
     Highlights
    -0.65
    orage
    -0.65
    should
    -0.64
    POSITIVE LOGITS
     inherently
    0.91
     inconvenient
    0.89
     already
    0.89
     cheaper
    0.86
     intrinsically
    0.86
     sensitive
    0.85
     supposedly
    0.84
     deemed
    0.83
     inaccessible
    0.81
     unpopular
    0.80
    Act Density 0.316%

    No Known Activations