INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     if
    -1.23
     there
    -1.16
     but
    -1.03
     this
    -1.00
     quien
    -0.98
     for
    -0.95
     If
    -0.95
    -0.94
    This
    -0.91
     Dieser
    -0.90
    POSITIVE LOGITS
     his
    2.69
     its
    2.03
     was
    1.68
     jego
    1.55
     его
    1.50
     is
    1.49
    1.45
     has
    1.44
     suoi
    1.43
     अपनी
    1.34
    Act Density 0.044%

    No Known Activations