INDEX
    Explanations

    verbs indicating change or development

    New Auto-Interp
    Negative Logits
    yet
    -0.75
    initely
    -0.73
    psey
    -0.70
     respectively
    -0.69
     generally
    -0.68
     preserved
    -0.67
    matically
    -0.67
     preserves
    -0.66
    Generally
    -0.64
     inherently
    -0.64
    POSITIVE LOGITS
    mega
    0.74
    rier
    0.73
    ounters
    0.72
    inders
    0.69
    ORED
    0.69
     NETWORK
    0.69
    ORN
    0.69
    ackle
    0.67
    VALUE
    0.65
    Ô
    0.65
    Act Density 0.121%

    No Known Activations