INDEX
    Explanations

    mentions of significant events or changes over time

    happening or changing since

    New Auto-Interp
    Negative Logits
    <_>
    -0.49
    iblichen
    -0.43
     Privacidad
    -0.40
     biçim
    -0.37
    Trả
    -0.37
    يكب
    -0.37
     نهایت
    -0.36
     إحدى
    -0.35
    UrlResolution
    -0.35
     către
    -0.35
    POSITIVE LOGITS
     since
    0.82
    以来
    0.77
     depuis
    0.71
    since
    0.68
     sejak
    0.68
    depuis
    0.68
    自从
    0.66
    Since
    0.65
    以來
    0.64
     Since
    0.63
    Act Density 0.034%

    No Known Activations