INDEX
    Explanations

    references to the concept of time or duration

    New Auto-Interp
    Negative Logits
    ãĥ£
    -0.16
    aque
    -0.15
    аз
    -0.15
    ant
    -0.14
    rana
    -0.14
    anch
    -0.13
    γÏī
    -0.13
    rael
    -0.13
    iglia
    -0.13
     ever
    -0.13
    POSITIVE LOGITS
    -than
    0.19
     than
    0.18
    onta
    0.18
    than
    0.17
    lli
    0.16
    iban
    0.14
    ään
    0.14
    upe
    0.14
    ei
    0.14
    Roy
    0.14
    Act Density 0.034%

    No Known Activations