INDEX
    Explanations

    articles (the, a)

    New Auto-Interp
    Negative Logits
    #![
    -0.51
     كومونز
    -0.50
    Your
    -0.46
    DoubleQuotes
    -0.46
    Our
    -0.46
    際の
    -0.46
    Those
    -0.43
     själva
    -0.43
    Наша
    -0.43
     Those
    -0.43
    POSITIVE LOGITS
     Future
    0.79
     Past
    0.71
     future
    0.67
    UnusedPrivate
    0.66
    Future
    0.66
    menuStrip
    0.66
     World
    0.65
    setVerticalGroup
    0.65
    future
    0.63
     Circle
    0.60
    Act Density 0.002%

    No Known Activations