INDEX
    Explanations

    statements related to actions and expectations of a specific subject

    New Auto-Interp
    Negative Logits
     Blech
    -0.51
    τέλε
    -0.50
     saling
    -0.49
    ismet
    -0.48
    どうか
    -0.47
     بدان
    -0.46
     historical
    -0.44
     affaire
    -0.44
    ETHING
    -0.43
     our
    -0.43
    POSITIVE LOGITS
     himself
    1.48
    himself
    1.18
     Himself
    1.17
    SizeF
    0.89
     himſelf
    0.82
     his
    0.82
    因为他
    0.81
    ніципалі
    0.80
     Audiodateien
    0.75
    His
    0.74
    Act Density 0.326%

    No Known Activations