INDEX
    Explanations

    mentions of the name "Ben" or similar variations

    Tokens followed by "im", "Sev", or "Her"

    ben followed by names or words

    New Auto-Interp
    Negative Logits
    multer
    -0.52
    extAlignment
    -0.51
    Fatalf
    -0.51
    <strong>
    -0.50
     StatelessWidget
    -0.47
    utuhan
    -0.46
     compa
    -0.44
    Organisateur
    -0.44
    ILON
    -0.44
    omere
    -0.44
    POSITIVE LOGITS
     BEN
    1.05
    BEN
    0.99
     Ben
    0.98
     ben
    0.97
    ben
    0.94
     pinulongan
    0.94
    Ben
    0.93
     Bens
    0.90
     Benav
    0.82
     benches
    0.82
    Act Density 0.102%

    No Known Activations