INDEX
    Explanations

    proper names and terms in German

    New Auto-Interp
    Negative Logits
    theless
    -0.76
     compulsion
    -0.69
     captcha
    -0.68
     shockingly
    -0.64
     VIDE
    -0.63
     bullies
    -0.62
     microw
    -0.61
     duplication
    -0.60
    inarily
    -0.60
     bonded
    -0.60
    POSITIVE LOGITS
    liga
    0.97
     Mé
    0.90
     Ãī
    0.88
     qui
    0.86
     Univers
    0.85
    usalem
    0.85
    arten
    0.81
    士
    0.81
    Ãī
    0.81
    aceae
    0.79
    Act Density 0.154%

    No Known Activations