INDEX
    Explanations

    words related to spans or measurements of time or distance

    New Auto-Interp
    Negative Logits
    erm
    -0.16
    isted
    -0.15
    yas
    -0.15
    -ÑĤо
    -0.15
    wig
    -0.15
     spare
    -0.14
    ansson
    -0.14
    ives
    -0.14
    ystone
    -0.14
    praak
    -0.14
    POSITIVE LOGITS
    ned
    0.25
    ning
    0.25
    nable
    0.20
    iards
    0.18
     Span
    0.18
    /span
    0.17
    .Span
    0.16
     span
    0.16
    arta
    0.16
    berger
    0.16
    Act Density 0.015%

    No Known Activations