INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Badger
    -0.76
     Badger
    -0.74
     Monfieur
    -0.71
     araw
    -0.67
     Karin
    -0.66
     Coney
    -0.66
     Shal
    -0.66
     Manfred
    -0.65
     münchen
    -0.65
     sauer
    -0.65
    POSITIVE LOGITS
     It
    1.50
     it
    1.46
    It
    1.40
     its
    1.22
    1.19
     它
    1.18
    Its
    1.16
     Its
    1.12
    abestanden
    1.08
     IT
    1.05
    Act Density 0.283%

    No Known Activations