INDEX
    Explanations

    references to specific entities or items

    references to specific items or entities marked by the word "these."

    New Auto-Interp
    Negative Logits
    pless
    -0.74
    ppe
    -0.74
    orst
    -0.71
    esty
    -0.70
    achus
    -0.69
    ilet
    -0.67
    eling
    -0.66
    hood
    -0.64
    onis
    -0.63
    ë
    -0.63
    POSITIVE LOGITS
     particular
    0.78
     newfound
    0.78
     fellows
    0.75
     findings
    0.72
     kinds
    0.72
     latter
    0.69
     sorts
    0.67
     elusive
    0.66
     nifty
    0.65
     pesky
    0.65
    Act Density 0.058%

    No Known Activations