INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.02
    1:0.01
    2:0.05
    3:0.08
    4:0.12
    5:0.02
    6:0.04
    7:0.40
    8:0.03
    9:0.03
    10:0.05
    11:0.08
    Negative Logits
    ailability
    -2.33
     tradem
    -2.04
    phabet
    -1.97
    apons
    -1.91
    psey
    -1.84
    querque
    -1.74
    ngth
    -1.72
    ompl
    -1.69
    chnology
    -1.68
    ascus
    -1.67
    POSITIVE LOGITS
     Lange
    1.61
     [&
    1.59
     jokes
    1.57
    Planet
    1.50
     Watkins
    1.46
     Doodle
    1.38
     McA
    1.37
     Unch
    1.36
     Simple
    1.33
     Constable
    1.33
    Act Density 0.006%

    No Known Activations