INDEX
    Explanations

    temporal phrases and questions regarding time

    New Auto-Interp
    Negative Logits
    ilo
    -0.15
    alis
    -0.14
    .nlm
    -0.14
    imo
    -0.14
    riel
    -0.13
    illis
    -0.13
    ryn
    -0.13
    SystemService
    -0.13
    mel
    -0.13
    anguage
    -0.13
    POSITIVE LOGITS
     we
    0.15
    umann
    0.14
    airy
    0.14
    816
    0.14
     they
    0.14
     he
    0.13
    ÙİØ³
    0.13
     metic
    0.13
     McCorm
    0.12
    λÏĮ
    0.12
    Act Density 0.224%

    No Known Activations