INDEX
    Explanations

    instances of missing items or people

    New Auto-Interp
    Negative Logits
     InputDecoration
    -0.37
     Bewertung
    -0.35
     early
    -0.35
     Chatterjee
    -0.35
     Burch
    -0.35
     selection
    -0.34
    Enjoy
    -0.34
    ,
    -0.33
     analysis
    -0.33
     Sim
    -0.33
    POSITIVE LOGITS
     ſont
    0.65
     ſeinen
    0.61
     heartwarming
    0.58
     chi̍t
    0.57
     Geſch
    0.56
     دیکھیے
    0.56
     verſch
    0.55
    principalColumn
    0.55
     ſein
    0.54
     dieſer
    0.54
    Act Density 0.261%

    No Known Activations