INDEX
    Explanations

    phrases that express caution or warnings related to beliefs and decision-making processes

    New Auto-Interp
    Negative Logits
    uga
    -0.22
    ierz
    -0.15
     sar
    -0.15
    ago
    -0.14
     Grip
    -0.14
    ansa
    -0.14
    ÄĽt
    -0.14
    vn
    -0.14
    .outer
    -0.14
    ock
    -0.14
    POSITIVE LOGITS
    \Active
    0.15
    ео
    0.15
    requires
    0.15
     phải
    0.14
    rut
    0.14
    aidu
    0.14
    Requires
    0.14
     col
    0.14
    needs
    0.14
     Requires
    0.14
    Act Density 0.180%

    No Known Activations