INDEX
    Explanations

    references to authority figures and hierarchical structures in a narrative context

    New Auto-Interp
    Negative Logits
     AssemblyCulture
    -0.66
     NavController
    -0.56
     RouterModule
    -0.53
    /**
    -0.52
    ModelSerializer
    -0.51
    matchCondition
    -0.51
    XmlAccessType
    -0.47
     ſever
    -0.47
    ✨:
    -0.47
    tagext
    -0.47
    POSITIVE LOGITS
     tryna
    0.39
    back
    0.38
     gotta
    0.37
    dam
    0.36
     Flach
    0.36
     acostumb
    0.35
     hitter
    0.34
    xion
    0.34
    Nobody
    0.34
     like
    0.33
    Act Density 0.077%

    No Known Activations