INDEX
    Explanations

    repeated prepositional phrases indicating relationships or belonging

    New Auto-Interp
    Head Attr Weights
    0:0.03
    1:0.02
    2:0.07
    3:0.08
    4:0.17
    5:0.03
    6:0.02
    7:0.34
    8:0.03
    9:0.04
    10:0.06
    11:0.06
    Negative Logits
    uez
    -1.58
    daq
    -1.57
     Tsukuyomi
    -1.54
    ebook
    -1.50
    ebted
    -1.46
     hypot
    -1.42
    achus
    -1.37
    quished
    -1.36
     freely
    -1.34
    wered
    -1.33
    POSITIVE LOGITS
    burgh
    1.33
     toughness
    1.30
     sophistication
    1.29
     Hack
    1.28
     Colour
    1.28
     refinement
    1.28
     Studio
    1.28
     engagement
    1.26
    rity
    1.25
     professionalism
    1.23
    Act Density 0.001%

    No Known Activations