INDEX
    Explanations

    cooperation-related terms or phrases

    New Auto-Interp
    Negative Logits
    ãĥª
    -0.74
    arth
    -0.68
    nic
    -0.66
    mental
    -0.66
    fing
    -0.65
    brid
    -0.65
    bred
    -0.64
    marked
    -0.64
    season
    -0.59
    tha
    -0.58
    POSITIVE LOGITS
     favorably
    0.93
     extensively
    0.92
     positively
    0.89
     withd
    0.88
     peacefully
    0.88
     forcefully
    0.86
     negatively
    0.81
     violently
    0.81
     aggressively
    0.81
     vigorously
    0.80
    Act Density 11.368%

    No Known Activations