INDEX
    Explanations

    elements of argumentation centered around logic and claims of evidence

    New Auto-Interp
    Negative Logits
     told
    -0.07
    اعت
    -0.07
     Hubbard
    -0.07
     phon
    -0.06
    ï¼Ĭ
    -0.05
     eventual
    -0.05
    event
    -0.05
     Orbit
    -0.05
    phon
    -0.05
    tractor
    -0.05
    POSITIVE LOGITS
    Argument
    0.08
    pollo
    0.08
     Argument
    0.08
     arguments
    0.07
    ../../../../
    0.07
    arguments
    0.07
     Arguments
    0.07
    ystate
    0.07
     argument
    0.07
    argument
    0.07
    Act Density 0.003%

    No Known Activations