INDEX
    Explanations

    elements related to interpersonal relationships and character dynamics

    New Auto-Interp
    Negative Logits
     myſelf
    -0.58
     ligiloj
    -0.55
    scribers
    -0.55
     auffi
    -0.51
    ſelf
    -0.50
    يميديا
    -0.50
    Parcelize
    -0.50
     fubject
    -0.50
     zoude
    -0.49
     ilustracja
    -0.48
    POSITIVE LOGITS
     freaking
    0.48
     fucking
    0.46
     tbh
    0.46
     freakin
    0.45
     FUCKING
    0.44
     Fisch
    0.44
     humanity
    0.42
     Figue
    0.42
     fuckin
    0.42
     🤷
    0.42
    Act Density 0.068%

    No Known Activations