INDEX
    Explanations

    instances of the word "twice" and other related numerical repetitions

    New Auto-Interp
    Negative Logits
    ÙĥÙħ
    -0.16
    list
    -0.15
    shal
    -0.15
    reg
    -0.15
    ent
    -0.15
    ebra
    -0.15
    reta
    -0.15
     Meer
    -0.15
    .radioButton
    -0.14
    svc
    -0.14
    POSITIVE LOGITS
    oldur
    0.20
    缮ãģ®
    0.16
    opard
    0.16
    RDD
    0.15
    ograd
    0.15
    fold
    0.15
    /qu
    0.15
    Łèĥ½
    0.15
    ushima
    0.14
    isque
    0.14
    Act Density 0.030%

    No Known Activations