INDEX
    Explanations

    instances of the letter 'r' in various contexts

    New Auto-Interp
    Negative Logits
    ÅĻet
    -0.16
    ots
    -0.15
    igated
    -0.14
    ulings
    -0.14
    bil
    -0.14
    озÑĸ
    -0.14
    bill
    -0.14
    DED
    -0.14
    acus
    -0.14
    ishops
    -0.13
    POSITIVE LOGITS
     r
    0.37
    =r
    0.22
    arer
    0.21
    )r
    0.21
    <r
    0.19
    r
    0.19
    ;r
    0.19
     ÑĢ
    0.18
    :r
    0.18
    (r
    0.17
    Act Density 0.021%

    No Known Activations