INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.07
    2:0.08
    3:0.08
    4:0.08
    5:0.08
    6:0.08
    7:0.09
    8:0.07
    9:0.08
    10:0.08
    11:0.09
    Negative Logits
     Blooming
    -2.92
     Cre
    -2.64
    -2.62
    abeth
    -2.61
     Fountain
    -2.56
     Erin
    -2.54
     lav
    -2.48
    -2.46
    Textures
    -2.43
     Hills
    -2.39
    POSITIVE LOGITS
     fuse
    2.75
     Heller
    2.74
    riots
    2.64
    angers
    2.63
     hazard
    2.52
     strikers
    2.48
    Nazis
    2.46
    oxide
    2.46
    aminer
    2.44
     antioxid
    2.44
    Act Density 0.000%

    No Known Activations