INDEX
    Explanations

    Defamation and false statements

    New Auto-Interp
    Negative Logits
    Iteration
    -0.08
     нужен
    -0.07
     Sa
    -0.07
    -0.07
     Claudia
    -0.07
    MAD
    -0.07
     SAT
    -0.07
     ceramics
    -0.07
     iteration
    -0.07
     pic
    -0.07
    POSITIVE LOGITS
     defamatory
    0.13
     alleged
    0.10
     factual
    0.10
     Hoop
    0.10
     unjust
    0.09
    涉嫌
    0.09
     impair
    0.09
     prejud
    0.09
     तथ्य
    0.09
     unlaw
    0.09
    Act Density 0.008%

    No Known Activations