INDEX
    Explanations

    statements and discussions related to arguments or claims made in a debate

    New Auto-Interp
    Negative Logits
     OSI
    -0.54
     Dina
    -0.50
     Whitby
    -0.46
    MAO
    -0.46
     methylene
    -0.45
    )•
    -0.45
     INA
    -0.45
    detectChanges
    -0.45
     ‹
    -0.45
    atibility
    -0.44
    POSITIVE LOGITS
     argument
    1.46
     arguments
    1.38
     argue
    1.36
    argument
    1.35
     argued
    1.29
    arguments
    1.27
    Argument
    1.27
     Argument
    1.27
     argues
    1.22
     arguing
    1.20
    Act Density 0.232%

    No Known Activations