INDEX
    Explanations

    words related to physical features or characteristics

    prefixes and root forms of words that indicate various actions or conditions

    New Auto-Interp
    Negative Logits
    Ô
    -0.89
    å§«
    -0.75
     Limited
    -0.70
     Exper
    -0.70
     Commit
    -0.68
    terday
    -0.67
     Hearts
    -0.67
    à¼
    -0.66
     Disorder
    -0.66
     Challenges
    -0.65
    POSITIVE LOGITS
    cients
    1.15
    osphere
    1.14
    iest
    1.05
    isher
    0.95
    iph
    0.95
    encer
    0.94
    elfth
    0.91
    icer
    0.91
    asher
    0.91
    abase
    0.89
    Act Density 0.286%

    No Known Activations