INDEX
    Explanations

    references to philosophical concepts or figures, particularly in relation to Objectivism and value systems

    New Auto-Interp
    Negative Logits
    elu
    -0.15
    šku
    -0.15
    mlink
    -0.14
    ucene
    -0.14
    ä»·
    -0.13
    ucci
    -0.13
    ONO
    -0.13
     Balance
    -0.13
     Vladim
    -0.13
    batis
    -0.13
    POSITIVE LOGITS
     Rand
    0.21
    /rand
    0.19
     Atlas
    0.19
     peaceful
    0.19
     Hay
    0.18
    usta
    0.18
    Atlas
    0.17
     Roth
    0.17
    Rand
    0.17
     Ludwig
    0.17
    Act Density 0.026%

    No Known Activations