INDEX
    Explanations

    terms related to discussions or debates

    terms related to discussions or debates

    New Auto-Interp
    Negative Logits
     dece
    -0.76
     foll
    -0.66
     bapt
    -0.65
     ren
    -0.62
     Hebrew
    -0.62
     gar
    -0.62
     mage
    -0.61
     est
    -0.60
     Gil
    -0.59
     dip
    -0.59
    POSITIVE LOGITS
    ussed
    3.94
    ussions
    3.26
    ussion
    2.82
    uss
    1.90
    USS
    1.49
    ourage
    1.28
    ussian
    1.20
    retion
    1.20
    ourse
    1.20
    overed
    1.15
    Act Density 0.027%

    No Known Activations