INDEX
    Explanations
    New Auto-Interp
    Negative Logits
                                                                
    -0.08
    Marc
    -0.07
     pastors
    -0.06
     vztah
    -0.06
    winter
    -0.06
    otes
    -0.06
     predictors
    -0.06
    ratio
    -0.06
    .encrypt
    -0.06
    que
    -0.06
    POSITIVE LOGITS
     need
    0.12
     needed
    0.12
     needs
    0.09
     Need
    0.09
    need
    0.09
     required
    0.08
    Loaded
    0.08
     Needed
    0.08
    needed
    0.07
     NEED
    0.07
    Act Density 0.099%

    No Known Activations