INDEX
    Explanations

    dialogue and conversational exchanges

    Comes after a personal pronoun

    New Auto-Interp
    Negative Logits
     obviamente
    -0.75
     itſelf
    -0.74
     tangerang
    -0.72
     fieldNum
    -0.70
     bekasi
    -0.70
    tacular
    -0.70
     awesome
    -0.69
    awesome
    -0.68
     poffe
    -0.66
     Awesome
    -0.66
    POSITIVE LOGITS
    djangoproject
    0.52
     queer
    0.47
     verás
    0.47
    шень
    0.46
     bruk
    0.46
     plenty
    0.46
     fellows
    0.45
     gorra
    0.45
    Mamma
    0.44
     idee
    0.44
    Act Density 0.175%

    No Known Activations