INDEX
    Explanations

    words related to swirling or spinning

    instances of the term "girl" in various contexts

    New Auto-Interp
    Negative Logits
    ħĭ
    -0.69
    QUIRE
    -0.62
     smart
    -0.61
     mature
    -0.60
     marrow
    -0.60
     punishing
    -0.59
     personalized
    -0.59
     imposing
    -0.59
     Forbidden
    -0.59
     demanding
    -0.58
    POSITIVE LOGITS
    irl
    1.04
    itudinal
    0.96
    onge
    0.89
    iot
    0.89
    iated
    0.88
    itude
    0.86
    iffe
    0.85
    ipedia
    0.85
    ados
    0.84
    oin
    0.84
    Act Density 0.008%

    No Known Activations