INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.07
    2:0.09
    3:0.09
    4:0.08
    5:0.08
    6:0.09
    7:0.07
    8:0.08
    9:0.07
    10:0.09
    11:0.07
    Negative Logits
     Barber
    -2.75
     handshake
    -2.67
     Greenberg
    -2.56
     booths
    -2.47
     arms
    -2.40
     pumps
    -2.38
     signage
    -2.36
     haircut
    -2.33
     promoters
    -2.33
     sue
    -2.31
    POSITIVE LOGITS
    3.42
    3.13
    ��
    3.07
    3.06
    hent
    2.82
    enne
    2.79
     Rohing
    2.70
    dimension
    2.69
    upload
    2.69
    translation
    2.69
    Act Density 0.000%

    No Known Activations