INDEX
    Explanations

    conjunctions and words indicating contrast or condition

    New Auto-Interp
    Negative Logits
     Pratt
    -0.16
    ndon
    -0.14
    assi
    -0.14
    EditingStyle
    -0.14
    ecko
    -0.14
    ):?>↵
    -0.14
    illo
    -0.13
    ÑĢиÑĩ
    -0.13
    otechn
    -0.13
     hence
    -0.13
    POSITIVE LOGITS
    óc
    0.15
    ustin
    0.14
    lew
    0.14
     Dia
    0.14
     Flesh
    0.14
    ãĥ©ãĤ¤ãĥĪ
    0.13
     Traditional
    0.13
    LEC
    0.13
    å§¿
    0.13
    ene
    0.13
    Act Density 0.124%

    No Known Activations