INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.06
    2:0.10
    3:0.08
    4:0.09
    5:0.07
    6:0.09
    7:0.07
    8:0.08
    9:0.07
    10:0.08
    11:0.07
    Negative Logits
    sonian
    -1.92
     generously
    -1.64
    crim
    -1.62
     freezer
    -1.57
    ˈ
    -1.55
    quartered
    -1.55
     flanked
    -1.53
     gorge
    -1.52
     lush
    -1.49
     proceeds
    -1.49
    POSITIVE LOGITS
    Warren
    1.77
     Tempest
    1.76
     Remain
    1.75
     Pearce
    1.75
     Surviv
    1.75
    Higher
    1.73
    ה
    1.65
     Oracle
    1.65
     Judgment
    1.59
    ר
    1.59
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.