INDEX
    Explanations

    themes related to social and cultural identity

    New Auto-Interp
    Negative Logits
    loff
    -0.15
    Visibility
    -0.15
     Bram
    -0.14
    ansk
    -0.14
    getter
    -0.14
     Carlson
    -0.14
    tep
    -0.13
    Ù쨳
    -0.13
     
    -0.13
    ullen
    -0.13
    POSITIVE LOGITS
     nature
    0.43
     aspect
    0.39
     element
    0.37
    nature
    0.35
     aspects
    0.33
     angle
    0.32
     component
    0.31
     flavor
    0.31
    aspect
    0.31
     bent
    0.30
    Act Density 0.265%

    No Known Activations