INDEX
    Explanations

    instances of the word "right"

    New Auto-Interp
    Negative Logits
    aples
    -0.70
    igmat
    -0.64
    arette
    -0.64
     cellul
    -0.60
    idated
    -0.60
    âĢ¢
    -0.59
    ulative
    -0.59
    ĸļ
    -0.58
    raved
    -0.58
    advert
    -0.58
    POSITIVE LOGITS
    eous
    1.09
     smack
    0.91
     fielder
    0.84
     wing
    0.75
    move
    0.71
     winger
    0.69
    ward
    0.68
    ocrin
    0.67
    Ĥİ
    0.66
    behind
    0.66
    Act Density 0.025%

    No Known Activations