INDEX
    Explanations

    words related to opposition or resistance

    words related to assumptions or beliefs

    New Auto-Interp
    Negative Logits
    SEA
    -0.85
     Masters
    -0.83
    AMS
    -0.83
    GER
    -0.80
     Timber
    -0.74
    DER
    -0.73
     Abyssal
    -0.72
    Forest
    -0.72
     Schneider
    -0.72
     Rite
    -0.71
    POSITIVE LOGITS
     supp
    1.30
     scrut
    1.05
    lication
    0.99
    osition
    0.95
    ressive
    0.94
     pse
    0.92
    orter
    0.90
    roleum
    0.86
    uration
    0.85
     plaus
    0.84
    Act Density 0.008%

    No Known Activations