INDEX
    Explanations

    structured lists and descriptions of collective entities or concepts

    New Auto-Interp
    Negative Logits
     both
    -0.26
     Both
    -0.22
    Both
    -0.22
    both
    -0.22
     BOTH
    -0.20
     beide
    -0.20
     third
    -0.17
    tring
    -0.17
    _both
    -0.16
    両
    -0.16
    POSITIVE LOGITS
     four
    0.56
     five
    0.52
    five
    0.46
     six
    0.45
     seven
    0.43
    four
    0.41
     bá»ijn
    0.41
     eight
    0.40
     ÑĩеÑĤÑĭ
    0.39
     cuatro
    0.38
    Act Density 0.184%

    No Known Activations