INDEX
    Explanations

    phrases related to specifying locations or conditions

    phrases indicating conditions or states related to systems

    New Auto-Interp
    Negative Logits
    luaj
    -0.77
    ode
    -0.71
    AMY
    -0.70
    Others
    -0.69
    english
    -0.64
     Others
    -0.64
    tro
    -0.62
    uc
    -0.62
    incial
    -0.60
    anton
    -0.59
    POSITIVE LOGITS
     whenever
    1.43
     if
    1.31
     whoever
    1.20
     each
    1.13
     unless
    1.09
     when
    1.09
     whichever
    1.08
     every
    1.00
     suppose
    0.97
     unlike
    0.96
    Act Density 0.320%

    No Known Activations