INDEX
    Explanations

    mentions of specific individuals and their contributions in a professional context

    New Auto-Interp
    Negative Logits
    ../../../../
    -0.16
    raquo
    -0.15
    pper
    -0.15
    ibbon
    -0.14
    leur
    -0.14
    ãĥŁãĥ¥
    -0.14
    леменÑĤ
    -0.13
    verbatim
    -0.13
    IRA
    -0.13
    ech
    -0.13
    POSITIVE LOGITS
     u
    0.32
     reg
    0.26
     me
    0.25
     hä
    0.25
     meis
    0.23
     dane
    0.22
     au
    0.22
     auch
    0.21
     ua
    0.21
     z
    0.20
    Act Density 0.032%

    No Known Activations