INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.06
    2:0.08
    3:0.09
    4:0.08
    5:0.10
    6:0.08
    7:0.08
    8:0.08
    9:0.07
    10:0.07
    11:0.09
    Negative Logits
     Brach
    -1.49
     Abrams
    -1.49
     intrig
    -1.48
    orthy
    -1.48
     indisc
    -1.43
     advances
    -1.41
     Brill
    -1.39
    orius
    -1.38
     spoiler
    -1.36
     Iv
    -1.36
    POSITIVE LOGITS
    usa
    1.73
    auri
    1.65
    da
    1.58
    Northern
    1.56
    Phone
    1.53
    س
    1.51
    AIR
    1.50
    wx
    1.47
    gob
    1.47
    trak
    1.46
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.