INDEX
    Explanations

    discussions around various political, social, and creative topics

    New Auto-Interp
    Negative Logits
    .ActionBar
    -0.21
    ABI
    -0.21
    abb
    -0.20
    aal
    -0.19
    αλ
    -0.18
     Alam
    -0.18
    abbit
    -0.18
    abies
    -0.18
     Alb
    -0.18
    aber
    -0.18
    POSITIVE LOGITS
     ihnen
    0.15
     eux
    0.15
    auss
    0.15
     Há»į
    0.14
     them
    0.14
    them
    0.14
     há»į
    0.14
     they
    0.14
     equally
    0.13
    asio
    0.13
    Act Density 0.044%

    No Known Activations