INDEX
    Explanations

    proper names and titles, particularly with 'von'

    instances of the name "von" in various contexts

    New Auto-Interp
    Negative Logits
    gallery
    -0.84
    taboola
    -0.78
    rentice
    -0.70
    orative
    -0.69
    inates
    -0.68
    uyomi
    -0.68
    ij士
    -0.67
    ा
    -0.67
    à¥
    -0.67
    inals
    -0.66
    POSITIVE LOGITS
     Braun
    0.85
     Frey
    0.85
    amins
    0.73
    env
    0.72
    hof
    0.72
     Doom
    0.69
     der
    0.69
     Karma
    0.68
    wald
    0.68
     Schwarz
    0.68
    Act Density 0.019%

    No Known Activations