INDEX
    Explanations

    instances of the word "despite" indicating a contrast or contradiction

    New Auto-Interp
    Negative Logits
     lampe
    -0.42
     ldc
    -0.39
    Русский
    -0.37
     tomado
    -0.36
    Bound
    -0.35
     ön
    -0.34
    下一个
    -0.33
     pén
    -0.33
    selaer
    -0.33
    tagext
    -0.33
    POSITIVE LOGITS
     despite
    1.34
    despite
    1.27
     nonostante
    1.20
     Despite
    1.19
    Despite
    1.15
    ostante
    1.13
     malgré
    1.12
     Malgré
    1.09
     trotz
    1.09
     رغم
    0.95
    Act Density 0.175%

    No Known Activations