INDEX
    Explanations

    phrases related to intentions or actions of individuals

    New Auto-Interp
    Negative Logits
     depic
    -1.15
     encomp
    -1.12
     inev
    -1.10
     disagre
    -1.09
     apprehen
    -1.05
     „,
    -1.05
     increa
    -1.04
     Juf
    -1.04
     hcm
    -1.04
     alre
    -1.03
    POSITIVE LOGITS
     himself
    1.31
     his
    1.12
    himself
    1.08
     Himself
    0.90
    his
    0.90
     he
    0.75
    His
    0.74
     His
    0.72
     seiner
    0.72
     seinem
    0.70
    Act Density 0.543%

    No Known Activations