INDEX
    Explanations

    expressions indicating a certain sentiment or attitude towards a situation or statement

    expressions indicating familiarity or mediocrity

    New Auto-Interp
    Negative Logits
    oulos
    -0.86
    perty
    -0.72
    edia
    -0.72
     VIDEOS
    -0.69
    IBLE
    -0.68
    oppers
    -0.64
    KS
    -0.62
    ilitary
    -0.62
    å§«
    -0.62
    çīĪ
    -0.61
    POSITIVE LOGITS
    este
    0.74
    ¹
    0.69
    cast
    0.69
    nered
    0.68
     grapes
    0.68
    hearted
    0.67
    etter
    0.65
    ling
    0.64
    lier
    0.64
    assy
    0.64
    Act Density 0.026%

    No Known Activations