INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     creams
    -0.09
    veloped
    -0.08
     Magdalena
    -0.08
    plast
    -0.08
     dend
    -0.08
     Protestant
    -0.08
     renting
    -0.08
     Arrival
    -0.08
     Doctrine
    -0.08
     béton
    -0.08
    POSITIVE LOGITS
     trivia
    0.17
     Trivia
    0.16
    Trivia
    0.15
     quizzes
    0.13
    quiz
    0.12
    Quiz
    0.12
     quiz
    0.12
    竞猜
    0.12
     Quiz
    0.11
     questions
    0.11
    Act Density 0.013%

    No Known Activations