INDEX
    Explanations

    instances of numerical or coded references

    New Auto-Interp
    Negative Logits
    708
    -0.16
    oyal
    -0.16
    nel
    -0.15
    彦
    -0.15
    609
    -0.14
    ites
    -0.14
    663
    -0.14
    ofi
    -0.14
    oyer
    -0.14
    167
    -0.14
    POSITIVE LOGITS
     Foot
    0.21
     foot
    0.18
     FOOT
    0.16
    foot
    0.16
    аниÑĨ
    0.15
    Foot
    0.15
    館
    0.15
    ÙĤاÙĦ
    0.15
    gua
    0.14
    leneck
    0.14
    Act Density 0.032%

    No Known Activations