INDEX
    Explanations

    phrases indicating a cause-and-effect relationship

    phrases indicating outcomes or consequences

    New Auto-Interp
    Negative Logits
    crow
    -0.65
    tsky
    -0.65
    fil
    -0.61
     substr
    -0.61
    edo
    -0.59
     Neighbor
    -0.59
    parent
    -0.58
    pe
    -0.58
    kov
    -0.58
     parent
    -0.58
    POSITIVE LOGITS
    antly
    0.86
    aneously
    0.80
    inally
    0.78
    âĹ¼
    0.73
    INAL
    0.73
    ĸļ
    0.73
    ively
    0.72
    ENCY
    0.70
    ivating
    0.68
    UE
    0.67
    Act Density 0.043%

    No Known Activations