INDEX
    Explanations

    Informational writing

    New Auto-Interp
    Negative Logits
    Lab
    -0.08
     Mood
    -0.08
     histor
    -0.08
    _DIALOG
    -0.08
     VH
    -0.08
     gubern
    -0.07
     matrimonial
    -0.07
    -0.07
     Lab
    -0.07
    History
    -0.07
    POSITIVE LOGITS
     KP
    0.08
    ,也是
    0.08
     abst
    0.08
     букв
    0.08
     سورة
    0.08
    0.08
     GBP
    0.08
    (weights
    0.08
     누구
    0.07
     Anytime
    0.07
    Act Density 0.050%

    No Known Activations