INDEX
    Explanations

    conjunctions that indicate connections and comparisons in sentences

    New Auto-Interp
    Negative Logits
     R
    -0.57
     I
    -0.56
    <eos>
    -0.55
     information
    -0.53
     -
    -0.52
     i
    -0.52
     N
    -0.52
     two
    -0.52
    ↵↵
    -0.52
     Ex
    -0.51
    POSITIVE LOGITS
    存于互联网档案馆
    0.89
     AssemblyCulture
    0.87
    ագրություններ
    0.81
     Мексичка
    0.81
    tvguidetime
    0.79
     незавершена
    0.78
    ształ
    0.77
    AnchorStyles
    0.75
     виправивши
    0.73
     كومونز
    0.72
    Act Density 0.322%

    No Known Activations