INDEX
    Explanations

    selecting options or items

    New Auto-Interp
    Negative Logits
    certain
    -1.10
    Certain
    -0.96
     fhew
    -0.96
     рассматри
    -0.92
    场比赛
    -0.88
     erstmal
    -0.85
    nch
    -0.85
     链
    -0.85
    mathrm
    -0.85
     卡片
    -0.82
    POSITIVE LOGITS
     one
    2.41
     ONE
    1.53
     appropriate
    1.34
     which
    1.31
     only
    1.28
    ONE
    1.20
    One
    1.13
     один
    1.13
     option
    1.10
     best
    1.08
    Act Density 0.023%

    No Known Activations