INDEX
    Explanations

    conditions or hypothetical situations described with an "if" clause

    New Auto-Interp
    Negative Logits
    emis
    -0.62
    tten
    -0.59
    Born
    -0.58
    assadors
    -0.53
    ANI
    -0.51
    AMY
    -0.51
    atari
    -0.51
    tained
    -0.51
    agnetic
    -0.51
    atro
    -0.50
    POSITIVE LOGITS
     if
    2.91
     unless
    2.02
    if
    1.90
     If
    1.63
     IF
    1.58
    If
    1.58
    unless
    1.54
     whether
    1.40
     whenever
    1.34
     depending
    1.33
    Act Density 0.102%

    No Known Activations