INDEX
    Explanations

    emotional expressions and interactions among characters

    New Auto-Interp
    Negative Logits
    agan
    -0.17
    ľ
    -0.16
     fart
    -0.16
    ogn
    -0.15
    emales
    -0.14
     Info
    -0.14
    759
    -0.14
     area
    -0.14
    èĩ£
    -0.14
     Britain
    -0.14
    POSITIVE LOGITS
     queer
    0.20
     callers
    0.16
    [P
    0.15
    etur
    0.15
    /GPL
    0.14
    æ´²
    0.14
     Pussy
    0.14
     Weed
    0.14
     Perc
    0.14
    -girl
    0.14
    Act Density 0.118%

    No Known Activations