INDEX
    Explanations

    instances of conversational phrases or conditional statements in discussions about relationships

    New Auto-Interp
    Negative Logits
    aur
    -0.15
    ebin
    -0.15
     whereas
    -0.14
    æīį
    -0.14
    rud
    -0.14
    abaj
    -0.14
     Whereas
    -0.14
    urat
    -0.13
     Prec
    -0.13
     probably
    -0.13
    POSITIVE LOGITS
     varsa
    0.18
     _______,
    0.17
    ROKE
    0.16
     >",
    0.15
    rag
    0.15
    nosti
    0.14
     quieres
    0.14
    roke
    0.14
     yoksa
    0.14
    @",
    0.14
    Act Density 0.105%

    No Known Activations