INDEX
    Explanations

    words related to derogatory remarks or negative connotations

    words related to the concept of meaning, specifically through the prefix "dem" and variations thereof

    New Auto-Interp
    Negative Logits
     multif
    -0.70
    ModLoader
    -0.66
     lodging
    -0.60
     strawberries
    -0.59
     Bengal
    -0.59
     Annotations
    -0.59
     mids
    -0.58
     ILCS
    -0.57
     fishing
    -0.57
     pond
    -0.56
    POSITIVE LOGITS
    ufact
    1.07
    oppers
    0.79
    iewicz
    0.79
    agement
    0.74
    kas
    0.72
    ESA
    0.70
    eanor
    0.69
    oppable
    0.69
    rial
    0.68
    esson
    0.68
    Act Density 0.107%

    No Known Activations