INDEX
    Explanations

    political and legal terms

    New Auto-Interp
    Negative Logits
    Rules
    -0.85
    marks
    -0.83
    via
    -0.81
    Cho
    -0.78
    vine
    -0.76
    agree
    -0.74
    upon
    -0.74
    Enjoy
    -0.73
    flows
    -0.73
    Engine
    -0.73
    POSITIVE LOGITS
     sake
    1.42
     multitude
    1.16
     foreseeable
    1.15
     glimpse
    1.02
     variety
    1.01
     whopping
    1.00
     bunch
    0.99
     couple
    0.99
    ummies
    0.97
     handful
    0.97
    Act Density 6.668%

    No Known Activations