INDEX
    Explanations

    specific historical dates and events

    New Auto-Interp
    Negative Logits
    lopen
    -0.16
    ause
    -0.15
    hev
    -0.15
    ic
    -0.15
    OfClass
    -0.14
    apot
    -0.14
    169
    -0.14
    dol
    -0.14
    anel
    -0.14
    icles
    -0.14
    POSITIVE LOGITS
     ÙħÛĮÙĦادÛĮ
    0.28
    ëħĦ
    0.22
     edition
    0.22
    edition
    0.21
    å¹´
    0.20
     vintage
    0.19
    -present
    0.17
    Ø¡
    0.17
     годÑĥ
    0.17
    \.
    0.16
    Act Density 0.442%

    No Known Activations