INDEX
    Explanations

    references to the name "Katie" or similar variations

    New Auto-Interp
    Negative Logits
    iro
    -0.17
    aws
    -0.17
    ts
    -0.15
    rett
    -0.15
    rams
    -0.15
    ç·Ĵ
    -0.15
    unu
    -0.15
    labs
    -0.14
    haven
    -0.14
     Popular
    -0.14
    POSITIVE LOGITS
     Perry
    0.16
    ungan
    0.16
    did
    0.15
    егоднÑı
    0.15
    meni
    0.14
     COUR
    0.14
    ISTIC
    0.14
     Cour
    0.14
     McCabe
    0.14
    ToUpper
    0.14
    Act Density 0.009%

    No Known Activations