INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ronpa
    -0.53
    allan
    -0.52
     Max
    -0.51
     Grand
    -0.50
     numerus
    -0.49
     Moreau
    -0.49
    OGA
    -0.48
     GV
    -0.48
     BV
    -0.48
     Gregorio
    -0.48
    POSITIVE LOGITS
     Shirt
    1.21
    Shirt
    1.16
     shirt
    1.15
    Shirts
    1.04
    shirt
    1.02
     shirts
    1.00
     Shirts
    0.98
    shirts
    0.89
    hirt
    0.85
     Monfieur
    0.85
    Act Density 0.004%

    No Known Activations