INDEX
    Explanations

    quantitative aspects and comparisons in various contexts

    New Auto-Interp
    Negative Logits
     nào
    -0.57
    newswire
    -0.57
    র্ব
    -0.56
     العنوان
    -0.56
    stalt
    -0.55
    awaiter
    -0.55
    Slightly
    -0.54
     Reverso
    -0.52
    پان
    -0.52
     거
    -0.52
    POSITIVE LOGITS
     many
    0.95
     MANY
    0.92
    MANY
    0.90
    many
    0.85
     Many
    0.83
     Viele
    0.81
     vieler
    0.79
     Banyak
    0.78
     viele
    0.78
    Many
    0.78
    Act Density 0.949%

    No Known Activations