INDEX
    Explanations

    discussions around legal and social equality, particularly focusing on distinctions between different groups or classes of individuals

    New Auto-Interp
    Negative Logits
     сожалению
    -0.55
    ToTensor
    -0.49
     :(
    -0.48
     always
    -0.45
    haltens
    -0.45
     slecht
    -0.45
     microfibra
    -0.44
    انجليز
    -0.44
     (;;
    -0.43
     unfortunately
    -0.43
    POSITIVE LOGITS
     restrictions
    1.04
     restriction
    1.02
     Restrictions
    0.95
     barriers
    0.93
     boundaries
    0.92
     Restriction
    0.88
     barrier
    0.85
     limitation
    0.84
    restrictions
    0.84
     limitations
    0.83
    Act Density 0.350%

    No Known Activations