INDEX
    Explanations

    questions or inquiries regarding reasons and explanations

    "why" followed by a pronoun

    New Auto-Interp
    Negative Logits
    Them
    -0.58
     Cæsar
    -0.58
     vielä
    -0.56
     alfo
    -0.56
     Makefile
    -0.54
     Pelop
    -0.53
    Makefile
    -0.52
    stdc
    -0.52
    šak
    -0.50
     Roskov
    -0.49
    POSITIVE LOGITS
     we
    1.50
     they
    1.47
     there
    1.35
     the
    1.17
     it
    1.12
     you
    1.11
     he
    1.10
     someone
    0.94
     things
    0.92
     some
    0.92
    Act Density 0.478%

    No Known Activations