INDEX
    Explanations

    proper nouns, specifically names

    New Auto-Interp
    Negative Logits
     Hild
    -0.68
    hift
    -0.67
    Kira
    -0.65
     Sarg
    -0.64
     soeur
    -0.63
     chipping
    -0.63
    Becker
    -0.63
    1
    -0.62
     bic
    -0.62
     petal
    -0.61
    POSITIVE LOGITS
     POU
    1.20
     Vou
    1.16
     LOU
    1.14
     Cou
    1.13
     Gou
    1.13
     Kou
    1.13
     Cougars
    1.13
     rou
    1.12
     gou
    1.10
     MOU
    1.08
    Act Density 0.157%

    No Known Activations