INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Ciebie
    0.78
     culoare
    0.77
     종류
    0.76
     piensan
    0.76
     berasal
    0.74
     olmadığını
    0.73
    ఖా
    0.73
    0.73
     कुनै
    0.72
     پیسې
    0.72
    POSITIVE LOGITS
     efforts
    1.12
     its
    0.99
     advancements
    0.99
     the
    0.99
     attempts
    0.93
     rampant
    0.91
     widespread
    0.89
     how
    0.88
     disparities
    0.86
     sự
    0.85
    Act Density 0.673%

    No Known Activations