INDEX
    Explanations

    phrases that reference outcomes or conclusions

    New Auto-Interp
    Negative Logits
     zwiſchen
    -0.73
     beſch
    -0.68
    niſſe
    -0.67
    ſchen
    -0.65
     feroit
    -0.63
     verſch
    -0.61
     fashiola
    -0.61
    ſchaft
    -0.61
     dieſem
    -0.60
     queſto
    -0.60
    POSITIVE LOGITS
     result
    1.01
     Ergebnis
    0.78
    result
    0.77
     RESULT
    0.77
     outcome
    0.77
     Result
    0.76
     resultado
    0.75
     resulting
    0.74
     results
    0.73
     Resultat
    0.68
    Act Density 0.038%

    No Known Activations