INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Krypto
    -0.92
     випад
    -0.90
    为你
    -0.88
     wunder
    -0.86
    Ciclo
    -0.84
     schu
    -0.84
     viewType
    -0.84
     mensch
    -0.84
    vědět
    -0.84
    discussed
    -0.83
    POSITIVE LOGITS
     editor
    1.19
     shareholders
    1.13
     адре
    1.11
     Editor
    1.08
     friends
    1.07
     all
    1.03
     sangat
    1.03
     afectadas
    1.02
     Dear
    0.96
     congress
    0.95
    Act Density 0.041%

    No Known Activations