INDEX
    Explanations

    concepts related to philosophy, religion, and political systems

    New Auto-Interp
    Negative Logits
    ":"/
    -0.78
    ._
    -0.62
    ady
    -0.62
    imester
    -0.62
     guiName
    -0.61
    worldly
    -0.60
    owe
    -0.58
    İ
    -0.57
    ":-
    -0.56
    uder
    -0.56
    POSITIVE LOGITS
     etc
    1.99
    etc
    1.69
     ect
    1.06
    whatever
    0.92
     et
    0.88
     and
    0.82
    ...)
    0.78
    â̦)
    0.75
     &
    0.75
    ĪĴ
    0.70
    Act Density 0.807%

    No Known Activations