INDEX
    Explanations

    occurrences of the letter "B" in various contexts

    New Auto-Interp
    Negative Logits
    inary
    -0.16
    ween
    -0.16
    errat
    -0.15
     bilateral
    -0.14
    rowse
    -0.14
    opper
    -0.14
    lest
    -0.14
    оÑĢд
    -0.13
     NEC
    -0.13
    aba
    -0.13
    POSITIVE LOGITS
    legen
    0.17
    antz
    0.16
    icks
    0.16
    chop
    0.15
    egen
    0.15
    edio
    0.15
    ipp
    0.15
    ãĥ¼ãĥIJ
    0.15
    ickle
    0.15
     Rosenberg
    0.15
    Act Density 0.079%

    No Known Activations