INDEX
    Explanations

    phrases indicating repetition or nostalgia

    New Auto-Interp
    Negative Logits
    isÃŃ
    -0.17
    ãĥķãĤ§
    -0.17
    urum
    -0.16
    erring
    -0.15
    ondon
    -0.14
     Marvin
    -0.14
    erland
    -0.14
    enticator
    -0.14
    ozor
    -0.14
    oksen
    -0.14
    POSITIVE LOGITS
     again
    0.32
    again
    0.26
     Again
    0.24
     repeat
    0.23
    Again
    0.23
     повÑĤоÑĢ
    0.21
     Repeat
    0.21
     novamente
    0.21
     lại
    0.20
    AGAIN
    0.20
    Act Density 0.204%

    No Known Activations