INDEX
    Explanations

    temporal markers or references to specific times and dates

    New Auto-Interp
    Negative Logits
     ویکی‌پدی
    -0.41
    imageshack
    -0.40
     الرياضيه
    -0.40
    alp
    -0.40
    lorette
    -0.39
    ientôt
    -0.39
    bu
    -0.38
    참고
    -0.38
    tagHelperRunner
    -0.38
    Predecesor
    -0.38
    POSITIVE LOGITS
    SequentialGroup
    0.57
    0.52
     uſe
    0.52
     whoſe
    0.51
     againſt
    0.50
     myſelf
    0.48
     paſſ
    0.47
     becauſe
    0.46
     ſtand
    0.45
     anſ
    0.45
    Act Density 0.216%

    No Known Activations