INDEX
    Explanations

    phrases comparing or diminishing one thing to another

    phrases that diminish the significance of something by suggesting it is "nothing more than" a trivial or lesser version of itself

    New Auto-Interp
    Negative Logits
    ahime
    -0.83
    ode
    -0.80
    anta
    -0.77
    enser
    -0.70
     NCT
    -0.68
    20439
    -0.65
    oris
    -0.65
    arts
    -0.65
    oren
    -0.63
    rones
    -0.62
    POSITIVE LOGITS
     mediocre
    0.70
     rudimentary
    0.69
     filler
    0.67
    anke
    0.66
     cosmetic
    0.65
     subsistence
    0.64
     superficial
    0.62
     pure
    0.61
     hors
    0.60
     a
    0.59
    Act Density 0.059%

    No Known Activations