INDEX
    Explanations

    periods at the end of sentences

    New Auto-Interp
    Negative Logits
     charism
    -0.69
     clipboard
    -0.68
     glim
    -0.66
    cientious
    -0.64
     glasses
    -0.62
     Hitman
    -0.60
     Stoke
    -0.59
     Shinra
    -0.59
     mund
    -0.58
     slate
    -0.57
    POSITIVE LOGITS
    ctuary
    0.72
    jong
    0.72
    lopp
    0.71
    %%
    0.67
     Authors
    0.67
    vae
    0.67
     Spons
    0.67
    tm
    0.66
    jas
    0.66
    aneously
    0.65
    Act Density 0.244%

    No Known Activations