INDEX
    Explanations

    words related to emotions or actions emphasizing a sense of intensity or importance

    words related to different forms of the word "enemies" or expressions of opposition

    New Auto-Interp
    Negative Logits
    yip
    -0.77
    apest
    -0.75
    é¾įå
    -0.75
    jri
    -0.73
    cules
    -0.62
    ancial
    -0.60
     incon
    -0.60
    appropriate
    -0.59
    hops
    -0.58
    swick
    -0.57
    POSITIVE LOGITS
    ciating
    1.08
    achment
    0.98
    enment
    0.89
    ãĤ¨ãĥ«
    0.75
    emy
    0.73
    ĸļ
    0.73
    yll
    0.67
    iasm
    0.67
    emies
    0.65
    ached
    0.64
    Act Density 0.065%

    No Known Activations