INDEX
    Explanations

    mentions of bicycles

    New Auto-Interp
    Negative Logits
    arial
    -1.03
    arios
    -0.94
    nyder
    -0.82
    ips
    -0.78
    ests
    -0.77
    orial
    -0.77
    oys
    -0.76
    essee
    -0.76
    itia
    -0.76
    esting
    -0.76
    POSITIVE LOGITS
     bicycle
    1.00
    puter
    0.90
     Bicycle
    0.84
     erg
    0.83
    ©¶æ¥µ
    0.79
     bicycles
    0.77
     bicy
    0.76
     bicycl
    0.74
    ©¶æ
    0.74
     Friendly
    0.73
    Act Density 0.004%

    No Known Activations