INDEX
    Explanations

    words related to emphasizing importance, necessity, or highlighting

    phrases indicating urgency or necessity in various contexts

    New Auto-Interp
    Negative Logits
     underwear
    -0.71
     Cups
    -0.67
    slave
    -0.67
     alone
    -0.65
     Britann
    -0.63
     slaves
    -0.61
     bye
    -0.60
    apps
    -0.59
     Hair
    -0.59
     Noel
    -0.59
    POSITIVE LOGITS
     importance
    0.92
    ItemImage
    0.80
     convergence
    0.77
     resilience
    0.77
     heroism
    0.76
     hypocrisy
    0.75
     absurdity
    0.75
     similarities
    0.75
    undrum
    0.74
     dich
    0.74
    Act Density 0.491%

    No Known Activations