INDEX
    Explanations

    questions that express capability or possibility

    New Auto-Interp
    Negative Logits
    мага
    -0.16
    ullan
    -0.15
    ode
    -0.15
    owie
    -0.15
    enders
    -0.15
    Callable
    -0.15
    ninger
    -0.14
    asio
    -0.14
    ikan
    -0.14
    erty
    -0.14
    POSITIVE LOGITS
    inho
    0.16
    ipeg
    0.15
    inci
    0.15
    IgnoreCase
    0.14
    yaw
    0.14
    mmc
    0.13
    strument
    0.13
     Misc
    0.13
     Karlov
    0.13
    Bias
    0.13
    Act Density 0.026%

    No Known Activations