INDEX
    Explanations

    phrases expressing necessity or deficiency

    New Auto-Interp
    Negative Logits
     myſelf
    -0.98
     itſelf
    -0.86
    ParallelGroup
    -0.80
    ſelf
    -0.78
     themſelves
    -0.77
     raiſ
    -0.75
    });*/
    -0.73
     againſt
    -0.72
    ſelves
    -0.72
     uſed
    -0.70
    POSITIVE LOGITS
     needing
    1.04
     lacking
    0.98
     Need
    0.97
    0.95
    Need
    0.94
     Lack
    0.91
     lack
    0.91
     need
    0.90
     needed
    0.89
    缺少
    0.88
    Act Density 0.245%

    No Known Activations