INDEX
    Explanations

    mentions of the word "rose" and its variations, indicating a focus on floral references

    New Auto-Interp
    Negative Logits
    ]))
    
    -0.61
    ])));
    -0.57
    ]";
    -0.57
    )";
    
    -0.56
     onCancelled
    -0.51
    ]));
    
    -0.50
    ]),
    
    -0.50
    )';
    -0.50
     verste
    -0.49
    -0.49
    POSITIVE LOGITS
     rose
    1.23
     ROSE
    0.96
    rose
    0.91
     Rising
    0.91
    ORIENT
    0.90
    orient
    0.90
    Rose
    0.87
     Rose
    0.86
     ORIENT
    0.83
     orient
    0.83
    Act Density 0.121%

    No Known Activations