INDEX
    Explanations

    phrases related to identity and authority, particularly in the context of news reporting and personal accounts

    New Auto-Interp
    Head Attr Weights
    0:0.10
    1:0.02
    2:0.26
    3:0.15
    4:0.04
    5:0.06
    6:0.03
    7:0.05
    8:0.08
    9:0.02
    10:0.08
    11:0.05
    Negative Logits
     planners
    -3.01
     architects
    -2.74
     productivity
    -2.74
     mobility
    -2.67
     treadmill
    -2.63
     optimal
    -2.57
     glide
    -2.57
     innovations
    -2.47
     adaptive
    -2.45
     liv
    -2.40
    POSITIVE LOGITS
     apologised
    3.61
     TMZ
    3.57
     allegations
    3.57
     allegation
    3.54
     allege
    3.43
     suspicions
    3.42
     alleges
    3.40
     accusation
    3.39
     accusing
    3.36
     slander
    3.35
    Act Density 0.713%

    No Known Activations