INDEX
    Explanations

    unique tokens or identifiers in the text

    New Auto-Interp
    Negative Logits
    のですね
    -0.70
     sobretudo
    -0.68
    Obrigada
    -0.67
     dunque
    -0.66
     loveliness
    -0.65
     marvellous
    -0.65
     пожалуйста
    -0.63
     honourable
    -0.63
     Pls
    -0.63
     lovely
    -0.61
    POSITIVE LOGITS
     azz
    0.60
     gettin
    0.60
     wife
    0.59
    Identyfik
    0.59
     dang
    0.59
     bezeichneter
    0.56
     lookin
    0.56
     outta
    0.54
    Wife
    0.54
     stocker
    0.54
    Act Density 0.341%

    No Known Activations