INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.06
    1:0.06
    2:0.09
    3:0.07
    4:0.10
    5:0.07
    6:0.09
    7:0.10
    8:0.08
    9:0.07
    10:0.09
    11:0.08
    Negative Logits
    UGC
    -2.29
    Pokémon
    -1.81
     Transformers
    -1.73
    Poké
    -1.65
    DX
    -1.60
     Feld
    -1.59
    Ranked
    -1.54
     XY
    -1.53
     concessions
    -1.53
    */
    -1.49
    POSITIVE LOGITS
    quila
    1.75
    kefeller
    1.58
    agall
    1.53
    mony
    1.52
    ongyang
    1.45
    jon
    1.43
    ullivan
    1.42
    milo
    1.41
    pired
    1.41
    zman
    1.40
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.