INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.08
    2:0.09
    3:0.08
    4:0.07
    5:0.09
    6:0.09
    7:0.07
    8:0.08
    9:0.07
    10:0.08
    11:0.07
    Negative Logits
     Gallagher
    -2.77
     Decker
    -2.48
    itol
    -2.33
    Gall
    -2.31
     Gonz
    -2.30
     Cheese
    -2.28
    aste
    -2.23
     Roma
    -2.22
     Redskins
    -2.21
     Bundy
    -2.21
    POSITIVE LOGITS
    覚醒
    3.40
    sung
    3.20
    2.81
    ombies
    2.74
     裏�
    2.63
    */(
    2.50
    ##
    2.47
    ynes
    2.46
    %%%%
    2.45
    2.41
    Act Density 0.000%

    No Known Activations