INDEX
    Explanations

    references to the concept of "large" or "largeness" in various contexts

    New Auto-Interp
    Negative Logits
    se
    -0.16
    thing
    -0.15
    že
    -0.15
    ras
    -0.15
    batis
    -0.15
    467
    -0.14
     ราà¸Ħ
    -0.14
    ose
    -0.14
    ouro
    -0.14
     little
    -0.14
    POSITIVE LOGITS
    -scale
    0.54
    scale
    0.31
     scale
    0.31
     Scale
    0.28
    -format
    0.27
    Scale
    0.27
    cale
    0.25
    _scale
    0.24
     SCALE
    0.23
     enough
    0.23
    Act Density 0.046%

    No Known Activations