INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.08
    3:0.08
    4:0.08
    5:0.07
    6:0.09
    7:0.08
    8:0.07
    9:0.08
    10:0.09
    11:0.08
    Negative Logits
     SAT
    -3.58
    onge
    -3.24
     Wald
    -3.07
    jong
    -2.96
     SUN
    -2.88
     NK
    -2.83
     Nord
    -2.81
     Niet
    -2.78
    Cong
    -2.74
    -2.73
    POSITIVE LOGITS
     Harris
    2.71
     Amir
    2.69
     Cyborg
    2.56
    Harris
    2.52
    />
    2.46
    DC
    2.45
    Os
    2.38
    ush
    2.38
     Floyd
    2.31
     Hogan
    2.30
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.