INDEX
    Explanations

    references to authors or creators associated with works

    New Auto-Interp
    Negative Logits
    oft
    -0.17
    大人
    -0.16
    sel
    -0.15
    otel
    -0.15
    illez
    -0.14
    ookie
    -0.14
    olley
    -0.14
    agged
    -0.14
    307
    -0.14
    owitz
    -0.14
    POSITIVE LOGITS
    tractive
    0.16
    uzu
    0.16
    zi
    0.15
    εβ
    0.15
    une
    0.15
     metav
    0.14
    AGER
    0.14
    é϶
    0.14
    ZERO
    0.14
     Champagne
    0.14
    Act Density 0.007%

    No Known Activations