INDEX
    Explanations

    references to specific objects or physical locations

    New Auto-Interp
    Negative Logits
    idates
    -0.82
     furthermore
    -0.79
    rity
    -0.78
    erity
    -0.76
     moreover
    -0.73
    icals
    -0.71
     additionally
    -0.69
    hess
    -0.69
    fficient
    -0.67
    iencies
    -0.66
    POSITIVE LOGITS
     proverbial
    0.86
     sponge
    0.86
     steroids
    0.78
     miniature
    0.77
     magnet
    0.75
     Gest
    0.74
     aspirin
    0.73
    yip
    0.73
     heartbeat
    0.72
     spaghetti
    0.72
    Act Density 1.655%

    No Known Activations