INDEX
    Explanations

    references to wealth and enriched experiences, often through consumerism or lifestyle choices

    New Auto-Interp
    Negative Logits
    )"),
    -0.85
    `;
    
    -0.81
    />";
    -0.78
    ]),
    
    -0.77
    ')],
    -0.77
    )}
    
    -0.73
    "]);
    
    -0.73
    ")}
    -0.73
    ArrowToggle
    -0.72
    "]];
    -0.72
    POSITIVE LOGITS
     lol
    0.93
     lmao
    0.92
     haha
    0.77
     fucking
    0.77
     LOL
    0.77
    ?!
    0.76
     goddamn
    0.74
     freakin
    0.72
    lol
    0.72
     lmfao
    0.71
    Act Density 0.529%

    No Known Activations