INDEX
    Explanations

    temporal expressions indicating specific times or conditions

    New Auto-Interp
    Negative Logits
     Просто
    -0.48
    Simple
    -0.47
    long
    -0.47
    Long
    -0.45
    Real
    -0.45
     echter
    -0.44
    uoš
    -0.44
    вите
    -0.43
     Long
    -0.42
    Pure
    -0.42
    POSITIVE LOGITS
     propOrder
    1.05
     importantly
    0.83
    SequentialGroup
    0.81
     للاسماء
    0.80
     שוליים
    0.78
     CreateTagHelper
    0.76
    InstrumentedTest
    0.74
    何より
    0.72
    jmniej
    0.71
    رشف
    0.69
    Act Density 0.147%

    No Known Activations