INDEX
    Explanations

    German names or titles

    the occurrence of the name "von" in various contexts

    New Auto-Interp
    Negative Logits
    ional
    -0.84
     procedural
    -0.75
    UGC
    -0.74
    atari
    -0.71
    taboola
    -0.67
    Canadian
    -0.66
    mable
    -0.66
    wives
    -0.66
    orph
    -0.64
    ointment
    -0.63
    POSITIVE LOGITS
     Braun
    1.11
    neg
    0.90
     der
    0.88
     Syd
    0.87
     Frey
    0.87
     Kra
    0.85
     Wer
    0.83
     Doom
    0.82
     Stru
    0.82
     Schwarz
    0.81
    Act Density 0.044%

    No Known Activations