INDEX
    Explanations

    sentences that express contrast or conditional statements

    New Auto-Interp
    Negative Logits
    ucha
    -0.06
    ASM
    -0.06
    Stamp
    -0.06
    antis
    -0.06
     promise
    -0.06
    illo
    -0.06
    æĮ¯
    -0.06
    çĶŁåij½
    -0.06
    گرد
    -0.06
    bable
    -0.06
    POSITIVE LOGITS
     competition
    0.07
     Competition
    0.07
    ervals
    0.07
    competition
    0.07
     foc
    0.07
    ripper
    0.07
    รม
    0.06
     Wich
    0.06
     competit
    0.06
     busiest
    0.06
    Act Density 0.025%

    No Known Activations