INDEX
    Explanations

    mentions of numerical digits in various contexts

    New Auto-Interp
    Negative Logits
    u
    -0.54
    Rourke
    -0.53
    kuuta
    -0.52
     referrerpolicy
    -0.50
     Ragnarok
    -0.50
     aras
    -0.49
    a
    -0.49
    ūras
    -0.47
    太郎
    -0.47
    k
    -0.47
    POSITIVE LOGITS
     digits
    1.55
     digit
    1.38
    digits
    1.37
     Digit
    1.31
    digit
    1.19
     DIGIT
    1.19
    Digits
    1.18
    Digit
    1.16
    DIGIT
    1.11
     Efq
    1.05
    Act Density 0.002%

    No Known Activations