INDEX
    Explanations

    mentions of academic disciplines and fields of study, particularly focusing on philosophy

    occurrences of the word "philosophy" and related terms in the context of academic discussions

    New Auto-Interp
    Negative Logits
    esty
    -0.81
    女
    -0.79
    ilee
    -0.78
    elight
    -0.76
    ells
    -0.74
    ookie
    -0.73
    ded
    -0.73
    bring
    -0.71
    ilant
    -0.70
    kefeller
    -0.70
    POSITIVE LOGITS
    ophical
    1.34
    ophers
    1.00
    ophy
    0.92
     philosopher
    0.89
     philosophers
    0.89
    opher
    0.87
    otle
    0.85
    ophe
    0.85
    lectic
    0.82
     philosophy
    0.80
    Act Density 0.034%

    No Known Activations