INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     chapitre
    -0.08
     Chapter
    -0.08
    igail
    -0.07
    备注
    -0.07
    Chapter
    -0.07
    采取
    -0.07
    Slots
    -0.07
    -0.07
    重点
    -0.07
     рождения
    -0.07
    POSITIVE LOGITS
     aspectos
    0.08
     aspects
    0.08
     out
    0.08
     decorative
    0.08
     zaken
    0.08
     DETAIL
    0.08
     promoção
    0.08
     unpleasant
    0.07
     SCIP
    0.07
     SERVER
    0.07
    Act Density 0.003%

    No Known Activations