INDEX
    Explanations

    the conditional word "if"

    New Auto-Interp
    Negative Logits
    chester
    -1.72
     credibility
    -1.57
    idian
    -1.56
     deterg
    -1.50
     antigens
    -1.49
    >&
    -1.45
     vote
    -1.44
    ¿½
    -1.44
    erred
    -1.41
    atrix
    -1.40
    POSITIVE LOGITS
    simpl
    1.52
     Photograph
    1.51
    glass
    1.50
    leine
    1.49
    zes
    1.47
    shoot
    1.47
    own
    1.44
    example
    1.44
     objection
    1.42
    enÃŃ
    1.41
    Act Density 3.696%

    No Known Activations