INDEX
    Explanations

    references to philosophers and philosophical concepts

    New Auto-Interp
    Negative Logits
    ¥¿
    -0.17
    oller
    -0.15
    iteli
    -0.14
    .AI
    -0.14
    ieu
    -0.14
    isVisible
    -0.14
    .fromFunction
    -0.14
    çĸij
    -0.14
    .tie
    -0.14
    áº
    -0.13
    POSITIVE LOGITS
    frauen
    0.14
     Hubb
    0.14
    alet
    0.14
     Educ
    0.14
    ayan
    0.14
     educators
    0.13
     Orc
    0.13
     exact
    0.13
    -*-
    0.13
    ropri
    0.13
    Act Density 0.012%

    No Known Activations