INDEX
    Explanations

    references to the name "Bruce."

    New Auto-Interp
    Negative Logits
     Starlight
    -0.73
     meub
    -0.67
     Nelly
    -0.66
    раздо
    -0.66
    coration
    -0.66
     Haddad
    -0.64
     Octavia
    -0.64
     SEÑ
    -0.63
    ••••
    -0.63
     logement
    -0.63
    POSITIVE LOGITS
     Bruce
    1.74
    Bruce
    1.59
     BRUCE
    1.38
    bruce
    1.34
     bruce
    1.31
     Springsteen
    1.10
    bru
    1.02
    thâu
    0.89
     Bru
    0.86
     bru
    0.85
    Act Density 0.015%

    No Known Activations