INDEX
    Explanations

    irregularly contracted words that include an apostrophe

    words and phrases related to negative emotions or harmful actions

    New Auto-Interp
    Negative Logits
     Reef
    -0.76
     Leilan
    -0.73
     Kris
    -0.71
     KL
    -0.69
     Flavoring
    -0.69
     KS
    -0.65
     KP
    -0.65
     Ruler
    -0.64
     Kirin
    -0.64
     Ninth
    -0.63
    POSITIVE LOGITS
    vernment
    0.98
    usterity
    0.87
    ploy
    0.87
    ardless
    0.81
    selves
    0.79
    acters
    0.79
    ï¸ı
    0.79
    actly
    0.78
    roc
    0.78
    ancial
    0.77
    Act Density 0.192%

    No Known Activations