INDEX
    Explanations

    occurrences of the letter 'b' in various contexts

    New Auto-Interp
    Negative Logits
    uyết
    -0.16
    illard
    -0.15
    ffect
    -0.14
    usterity
    -0.14
     ÑĢайонÑĥ
    -0.14
     skoro
    -0.14
    kola
    -0.13
    /type
    -0.13
    utow
    -0.13
    arbon
    -0.13
    POSITIVE LOGITS
    imed
    0.17
    INES
    0.16
    ted
    0.16
    iven
    0.16
    inen
    0.15
    ok
    0.14
     framework
    0.14
    itta
    0.14
    ined
    0.14
    ast
    0.14
    Act Density 0.037%

    No Known Activations