INDEX
    Explanations

    adjectives or verbs related to softening

    references to "soft" concepts, indicating a focus on gentle or less aggressive approaches

    New Auto-Interp
    Negative Logits
    ulhu
    -0.76
    reon
    -0.73
    agher
    -0.72
     Ancients
    -0.71
     Pax
    -0.70
     McKenna
    -0.69
    ICAN
    -0.66
    USS
    -0.65
    OUGH
    -0.65
     Blessed
    -0.65
    POSITIVE LOGITS
    ening
    1.18
    ball
    1.11
    ener
    1.09
    hearted
    1.01
     palate
    0.99
    eners
    0.98
    ened
    0.91
    cover
    0.89
    ens
    0.88
    balls
    0.88
    Act Density 0.016%

    No Known Activations