INDEX
    Explanations

    phrases related to recommendations or suggestions

    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.10
    2:0.05
    3:0.06
    4:0.02
    5:0.10
    6:0.08
    7:0.09
    8:0.06
    9:0.09
    10:0.16
    11:0.07
    Negative Logits
    olitics
    -1.65
    ghazi
    -1.32
    ylum
    -1.24
     Hussein
    -1.23
     mosques
    -1.22
     sectarian
    -1.20
    politics
    -1.20
    Muslims
    -1.19
    judicial
    -1.19
    -1.17
    POSITIVE LOGITS
     usability
    1.42
     backend
    1.34
     optimization
    1.31
     homebrew
    1.30
     setup
    1.29
    Setup
    1.29
     testers
    1.28
     reusable
    1.28
    Availability
    1.27
     plugin
    1.27
    Act Density 0.719%

    No Known Activations