INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.03
    1:0.01
    2:0.12
    3:0.26
    4:0.13
    5:0.03
    6:0.05
    7:0.08
    8:0.03
    9:0.05
    10:0.11
    11:0.06
    Negative Logits
    20439
    -2.04
     theirs
    -1.65
     ratios
    -1.61
     discrepancies
    -1.58
    ubi
    -1.51
    ealous
    -1.51
    spread
    -1.48
    aretz
    -1.46
     Logged
    -1.45
    zinski
    -1.43
    POSITIVE LOGITS
    University
    1.77
     Exhibition
    1.70
     pav
    1.63
     Exploration
    1.55
     amph
    1.55
    Located
    1.45
     Methodist
    1.41
     Burke
    1.40
     pier
    1.38
    stairs
    1.38
    Act Density 0.010%

    No Known Activations