INDEX
    Explanations

    concepts related to philosophy, knowledge, and social structures

    New Auto-Interp
    Negative Logits
    ÃŃl
    -0.16
    essen
    -0.15
    inez
    -0.15
    izont
    -0.15
    olon
    -0.15
    579
    -0.15
    osten
    -0.14
    ijľ
    -0.14
    pedia
    -0.14
    abet
    -0.14
    POSITIVE LOGITS
    raid
    0.16
     Convert
    0.14
     cryst
    0.14
     ÅĻÃŃj
    0.14
    bout
    0.14
     Anch
    0.13
     éħ
    0.13
    itel
    0.13
    lee
    0.13
    iesel
    0.13
    Act Density 1.177%

    No Known Activations