INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.12
    1:0.02
    2:0.04
    3:0.14
    4:0.05
    5:0.05
    6:0.17
    7:0.05
    8:0.03
    9:0.24
    10:0.03
    11:0.03
    Negative Logits
    MU
    -2.89
     Au
    -2.86
     Skinner
    -2.83
     Deer
    -2.81
     McH
    -2.81
     Martial
    -2.78
     Stall
    -2.78
     Caval
    -2.74
     MU
    -2.73
     caval
    -2.71
    POSITIVE LOGITS
     Times
    3.84
     NYT
    3.60
    ny
    3.33
    yss
    3.27
    times
    3.26
    Times
    3.20
     TIM
    3.03
    ias
    3.03
     Tas
    3.02
    tm
    2.88
    Act Density 0.002%

    No Known Activations