INDEX
    Explanations

    specific entities mentioned in the document, such as product names, book titles, or locations

    elements related to specific content categories and media references

    New Auto-Interp
    Negative Logits
     opposite
    -0.62
     mush
    -0.59
     ``
    -0.58
     lit
    -0.56
     comparatively
    -0.54
     fertile
    -0.51
     forgiving
    -0.51
     ''
    -0.51
     theirs
    -0.51
     ÏĦ
    -0.51
    POSITIVE LOGITS
    anmar
    1.02
    resa
    0.98
    odore
    0.97
    agascar
    0.86
    foundland
    0.86
    pherd
    0.82
    bidden
    0.82
    romeda
    0.80
    jamin
    0.74
    xiety
    0.73
    Act Density 0.911%

    No Known Activations