INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.77
    migrationBuilder
    -0.75
     desmotivaciones
    -0.73
     zoude
    -0.72
     Geiſt
    -0.71
     ſeine
    -0.70
     queſta
    -0.68
     müſſen
    -0.68
     laſſen
    -0.68
    AISSEE
    -0.68
    POSITIVE LOGITS
    0.44
    !
    0.43
    ________________
    0.42
    ↵↵↵
    0.42
      
    0.40
    0.38
       
    0.35
    ldots
    0.35
    _
    0.35
    io
    0.34
    Act Density 0.003%

    No Known Activations