INDEX
    Explanations

    although, while, though

    New Auto-Interp
    Negative Logits
     whatsoever
    0.46
     aşağıdaki
    0.46
     malgré
    0.43
     Malgré
    0.43
     مهما
    0.40
     निम्नलिखित
    0.39
     assolutamente
    0.39
    orough
    0.39
    ської
    0.38
     👌
    0.38
    POSITIVE LOGITS
    雖然
    0.75
    虽然
    0.74
    Normally
    0.71
     Normally
    0.68
    aunque
    0.67
     although
    0.64
     indirectly
    0.64
    Although
    0.63
    While
    0.63
     Хотя
    0.61
    Act Density 0.010%

    No Known Activations