INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ractable
    -0.71
    ghua
    -0.68
    íncia
    -0.68
     @"/
    -0.67
    sschutz
    -0.67
     Balth
    -0.67
     Arden
    -0.65
    tisp
    -0.64
     Organis
    -0.64
     Kaufmann
    -0.64
    POSITIVE LOGITS
    oo
    0.82
     Moos
    0.82
     hoo
    0.82
     TEE
    0.80
    OOT
    0.80
    oos
    0.80
    Doo
    0.80
     Loo
    0.79
     moo
    0.79
    hoo
    0.77
    Act Density 0.569%

    No Known Activations