INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.05
    1:0.06
    2:0.08
    3:0.12
    4:0.07
    5:0.07
    6:0.11
    7:0.07
    8:0.07
    9:0.08
    10:0.09
    11:0.09
    Negative Logits
     Contact
    -1.46
     Inventory
    -1.44
     ______
    -1.40
    .")
    -1.35
    ergy
    -1.31
     Detail
    -1.28
     Furn
    -1.28
     Room
    -1.26
    "]
    -1.26
     Ammunition
    -1.26
    POSITIVE LOGITS
    1.79
    Sov
    1.65
     behavi
    1.50
     Indones
    1.47
    1.47
     surpr
    1.45
     Mulcair
    1.41
     McAuliffe
    1.41
    ModLoader
    1.39
     secondly
    1.39
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.