INDEX
    Explanations

    terms related to being influenced or guided by a specific driving force or motivation

    terms related to frameworks or systems that influence behavior or outcomes

    New Auto-Interp
    Negative Logits
     square
    -0.85
     balls
    -0.73
     conc
    -0.68
     present
    -0.66
     insign
    -0.66
     squares
    -0.65
     correct
    -0.65
     nails
    -0.65
     person
    -0.64
     photograph
    -0.64
    POSITIVE LOGITS
    dominated
    2.19
    heavy
    2.10
    driven
    2.10
    enabled
    1.81
    focused
    1.72
    turned
    1.68
    intensive
    1.65
    laden
    1.63
    centered
    1.62
    centric
    1.55
    Act Density 0.035%

    No Known Activations