INDEX
    Explanations

    especially if or because

    New Auto-Interp
    Negative Logits
     afin
    -0.13
     aby
    -0.10
     ÑĩÑĤобÑĭ
    -0.10
    æīįèĥ½
    -0.09
     Ñīоб
    -0.09
    cela
    -0.09
     chứ
    -0.09
     rather
    -0.09
     deÄŁil
    -0.09
    askell
    -0.08
    POSITIVE LOGITS
     especially
    0.23
     because
    0.21
     compared
    0.21
    especially
    0.18
     Especially
    0.17
    because
    0.16
    åĽłä¸º
    0.15
     оÑģобенно
    0.15
     Because
    0.15
    ï¼ĮåĽłä¸º
    0.14
    Act Density 0.081%

    No Known Activations