INDEX
    Explanations

    references to professors

    repeated mentions of the title "Professor" followed by names

    New Auto-Interp
    Negative Logits
     destro
    -0.88
     queen
    -0.84
    ãĥ¼ãĥ³
    -0.79
     cruc
    -0.69
     fracture
    -0.69
     leash
    -0.68
     queens
    -0.68
     chorus
    -0.67
     takeoff
    -0.66
    burning
    -0.65
    POSITIVE LOGITS
    essors
    1.06
     Laure
    0.89
     Puzz
    0.89
    ĨĴ
    0.88
     Emer
    0.83
     emer
    0.82
     Michel
    0.79
     Professor
    0.79
    umin
    0.77
     Wolfgang
    0.77
    Act Density 0.021%

    No Known Activations