INDEX
    Explanations

    punctuation and formatting in the text

    New Auto-Interp
    Negative Logits
    ãģıãĤĮ
    -0.14
    inoa
    -0.13
    andre
    -0.13
    æħ§
    -0.13
    thew
    -0.13
     Blowjob
    -0.13
    jaw
    -0.12
    exact
    -0.12
     surre
    -0.12
    éĢļ
    -0.12
    POSITIVE LOGITS
    Looper
    0.15
    ÑĥÑĢг
    0.14
    ENU
    0.14
    863
    0.13
    cket
    0.13
    roach
    0.13
    unicorn
    0.13
    observ
    0.13
    onaut
    0.13
    xea
    0.13
    Act Density 1.852%

    No Known Activations