INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.08
    1:0.08
    2:0.08
    3:0.08
    4:0.08
    5:0.08
    6:0.09
    7:0.07
    8:0.08
    9:0.08
    10:0.08
    11:0.07
    Negative Logits
     Johann
    -3.04
     colony
    -3.03
     archaeological
    -2.95
     colonies
    -2.71
     galactic
    -2.70
     Xuan
    -2.69
     annex
    -2.69
     Ricardo
    -2.67
     archae
    -2.63
     plateau
    -2.57
    POSITIVE LOGITS
     Fired
    3.08
    Honest
    2.88
    milo
    2.81
    Style
    2.79
     Respons
    2.69
    Snake
    2.67
     Clicker
    2.65
    soType
    2.64
    Role
    2.61
    Weak
    2.54
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.