INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Hieronymus
    -0.71
     Annika
    -0.67
     Karsten
    -0.64
     Katja
    -0.63
     Shakspeare
    -0.63
    Jörg
    -0.62
     Jörg
    -0.61
     pageIndex
    -0.61
     michelle
    -0.61
     Giovanna
    -0.61
    POSITIVE LOGITS
     kac
    1.05
     Portail
    0.95
     panik
    0.94
     karton
    0.93
     kram
    0.90
     makro
    0.88
     Dés
    0.88
     seksi
    0.88
     usta
    0.85
     Chá
    0.85
    Act Density 0.323%

    No Known Activations