INDEX
    Explanations

    mentions of specific names, particularly "Malkin" and "Morgan" in various contexts

    the name "Malkin" and its variants, indicating a focus on specific individuals

    New Auto-Interp
    Negative Logits
    ãĤ§
    -0.74
    ãĤ·ãĥ£
    -0.73
    BOOK
    -0.71
    âĸ¬âĸ¬
    -0.70
    cess
    -0.69
    PRES
    -0.68
    ciples
    -0.66
    ãĥį
    -0.66
    ffee
    -0.64
    BS
    -0.63
    POSITIVE LOGITS
     Malk
    1.19
    sburgh
    0.98
    ovich
    0.93
    atform
    0.82
    ules
    0.81
    mus
    0.80
    hin
    0.80
    rils
    0.80
    ers
    0.77
    ername
    0.76
    Act Density 0.017%

    No Known Activations