INDEX
    Explanations

    references to quantities of people or items, particularly emphasizing small groups or minorities

    New Auto-Interp
    Negative Logits
    ãĥ¼ãĥĢ
    -0.15
     ÑģобоÑİ
    -0.14
    asca
    -0.14
    .subplots
    -0.13
    Ïĩεία
    -0.13
    دار
    -0.13
     BOTH
    -0.13
    amble
    -0.13
    utas
    -0.13
    omo
    -0.13
    POSITIVE LOGITS
     few
    1.02
    few
    0.88
     Few
    0.83
    Few
    0.79
     handful
    0.71
     quelques
    0.63
     fewer
    0.56
    å°ij
    0.56
     select
    0.50
     vÃłi
    0.49
    Act Density 0.341%

    No Known Activations