INDEX
    Explanations

    references to critical theory and its applications

    New Auto-Interp
    Negative Logits
    éĻħ
    -0.14
    cribe
    -0.14
    ì°
    -0.14
    ιÏĩ
    -0.14
    kü
    -0.14
    erman
    -0.14
    ãģĸ
    -0.14
    ει
    -0.14
    erule
    -0.14
    elijk
    -0.14
    POSITIVE LOGITS
    ity
    0.35
     acclaim
    0.24
    ITY
    0.24
     jun
    0.19
    -path
    0.19
    ities
    0.18
     acclaimed
    0.18
    æĢ§çļĦ
    0.17
    s
    0.17
     mass
    0.17
    Act Density 0.015%

    No Known Activations