INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.08
    3:0.08
    4:0.08
    5:0.07
    6:0.09
    7:0.08
    8:0.07
    9:0.09
    10:0.07
    11:0.08
    Negative Logits
    ~~
    -2.98
     Hugo
    -2.63
     clinch
    -2.58
     Sovere
    -2.47
     Wim
    -2.42
     Extension
    -2.40
     Kim
    -2.39
     til
    -2.36
    uther
    -2.34
     Sovereign
    -2.32
    POSITIVE LOGITS
    olith
    2.94
    aji
    2.83
    Benz
    2.67
    Versions
    2.60
    zai
    2.58
    agonist
    2.58
    2.56
     Alto
    2.53
    appa
    2.49
     Cree
    2.42
    Act Density 0.000%

    No Known Activations