INDEX
    Explanations

    statements related to goals or purposes

    New Auto-Interp
    Negative Logits
     slips
    -0.65
     slides
    -0.63
    regular
    -0.63
    gypt
    -0.62
    ython
    -0.62
     equivalents
    -0.61
     stains
    -0.61
     clips
    -0.61
     emits
    -0.61
    haw
    -0.60
    POSITIVE LOGITS
     maximizing
    0.93
     revenge
    0.80
     ambitious
    0.79
     laud
    0.79
     fulfilled
    0.77
     restoring
    0.76
     vengeance
    0.76
     preservation
    0.73
     ensuring
    0.71
     preserving
    0.71
    Act Density 0.107%

    No Known Activations