INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.10
    1:0.07
    2:0.08
    3:0.08
    4:0.08
    5:0.09
    6:0.08
    7:0.08
    8:0.08
    9:0.06
    10:0.08
    11:0.07
    Negative Logits
     Pik
    -2.60
     earthquake
    -2.56
     tsun
    -2.53
    catentry
    -2.42
     avalanche
    -2.38
     iceberg
    -2.33
     quake
    -2.32
    angelo
    -2.31
     Avatar
    -2.29
     Covenant
    -2.28
    POSITIVE LOGITS
    Metro
    3.16
    MET
    3.06
    NRS
    2.90
     TTC
    2.86
    Mor
    2.86
    dyl
    2.75
    >>
    2.72
    itars
    2.68
    OY
    2.65
    Fuck
    2.62
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.