INDEX
    Explanations

    core and key principles

    New Auto-Interp
    Negative Logits
     key
    0.38
    重大
    0.37
     major
    0.36
    潜力
    0.35
    情况
    0.35
     Possible
    0.33
    状况
    0.33
     possibles
    0.33
    smöglichkeiten
    0.33
     शुभकामनाएं
    0.32
    POSITIVE LOGITS
     tenets
    0.79
     principle
    0.67
     principles
    0.64
    princ
    0.60
     concepts
    0.60
    concepts
    0.59
     принци
    0.58
     Concepts
    0.58
     takeaways
    0.57
    principles
    0.57
    Act Density 0.275%

    No Known Activations