INDEX
    Explanations

    references to historical figures and famous characters from various contexts

    references to historical figures, particularly philosophers and notable personalities

    New Auto-Interp
    Negative Logits
    WN
    -0.86
    CLA
    -0.77
    LU
    -0.75
    RAM
    -0.73
    WF
    -0.73
    VG
    -0.73
    WI
    -0.70
    leigh
    -0.70
    CI
    -0.69
    ve
    -0.68
    POSITIVE LOGITS
     Cthulhu
    0.96
     Reincarn
    0.82
    ocrates
    0.82
     Lovecraft
    0.81
    jriwal
    0.78
     Napoleon
    0.75
     Tsukuyomi
    0.75
     fascinated
    0.71
     Horus
    0.70
    emort
    0.69
    Act Density 0.023%

    No Known Activations