INDEX
    Explanations

    terms related to personal situations or experiences, including seeking help or explaining difficulties

    New Auto-Interp
    Negative Logits
    thood
    -0.78
    iaries
    -0.75
    luster
    -0.75
    ngth
    -0.70
    oso
    -0.68
    tackle
    -0.68
     CrossRef
    -0.67
    mage
    -0.67
    elight
    -0.65
    airs
    -0.64
    POSITIVE LOGITS
     rationale
    1.24
     reasoning
    1.08
     concepts
    0.94
     why
    0.94
     principles
    0.92
     virtues
    0.91
     criteria
    0.90
     workings
    0.89
    why
    0.89
     difference
    0.88
    Act Density 0.181%

    No Known Activations