INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     communists
    0.44
     restantes
    0.44
    必要
    0.43
     actores
    0.43
    0.43
    0.43
     notables
    0.42
     comrades
    0.41
     fate
    0.41
    SB
    0.41
    POSITIVE LOGITS
    it
    0.45
    isent
    0.44
    irittura
    0.44
     경우는
    0.43
    esse
    0.42
    have
    0.42
    łki
    0.42
     Improved
    0.41
    0.41
    itetty
    0.41
    Act Density 0.000%

    No Known Activations