INDEX
    Explanations

    mentions of specific years and dated time markers, especially recent calendar years and fiscal year labels (e.g., FY-year).

    New Auto-Interp
    Negative Logits
     sedative
    1.39
     surgi
    1.34
    1.32
    🙉
    1.31
     HAD
    1.30
     quinine
    1.26
     crippling
    1.23
    אר
    1.19
     astray
    1.17
     THERE
    1.15
    POSITIVE LOGITS
    ن
    1.90
    ف
    1.81
    1.71
    T
    1.69
    ל
    1.52
    k
    1.36
    h
    1.33
    𝗔
    1.30
    i
    1.29
    د
    1.27
    Act Density 0.020%

    No Known Activations