INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.09
    2:0.08
    3:0.08
    4:0.08
    5:0.09
    6:0.08
    7:0.07
    8:0.07
    9:0.07
    10:0.07
    11:0.08
    Negative Logits
     canvas
    -2.44
     colored
    -2.40
    ...
    -2.38
     Hawaiian
    -2.37
     HI
    -2.36
     reversible
    -2.28
    hod
    -2.24
    Fair
    -2.24
     descriptive
    -2.24
     Constantin
    -2.23
    POSITIVE LOGITS
    gob
    3.12
     gobl
    2.85
    GBT
    2.82
     Breitbart
    2.68
    FactoryReloaded
    2.67
     FB
    2.67
     UCH
    2.67
    mber
    2.64
    龍契士
    2.64
    tf
    2.57
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.