INDEX
    Explanations

    names and references to people, specifically those involved in various artistic and entertainment contexts

    New Auto-Interp
    Negative Logits
    ryn
    -0.15
    irth
    -0.15
    ãĥ¼ãĥł
    -0.15
     GameController
    -0.14
    apult
    -0.14
    osp
    -0.14
    itori
    -0.14
    quine
    -0.13
    insula
    -0.13
     sprink
    -0.13
    POSITIVE LOGITS
     pat
    0.20
     Pat
    0.19
    _pat
    0.18
    Pat
    0.17
    .pat
    0.17
     ãĥij
    0.16
    ãĤº
    0.16
    WXYZ
    0.16
    pat
    0.15
    (pat
    0.15
    Act Density 0.025%

    No Known Activations