INDEX
    Explanations

    descriptions related to superficiality and cliché in character representation

    New Auto-Interp
    Negative Logits
    ResponseBody
    -0.16
    ephir
    -0.15
    atham
    -0.15
     vidéos
    -0.15
    umo
    -0.14
     bordel
    -0.14
    avax
    -0.14
    ****************************
    -0.14
    ikh
    -0.14
    638
    -0.14
    POSITIVE LOGITS
     hack
    0.26
     kits
    0.25
     artificial
    0.24
     syrup
    0.24
     sac
    0.23
     tack
    0.23
    hack
    0.22
     chees
    0.22
     corn
    0.22
    kits
    0.21
    Act Density 0.498%

    No Known Activations