INDEX
    Explanations

    references to limits, restrictions, or numerical thresholds, often related to policies or regulations

    references to limits or thresholds, particularly in a financial or regulatory context

    New Auto-Interp
    Negative Logits
    ĪĴ
    -0.87
    hower
    -0.74
    selves
    -0.73
    cause
    -0.73
    cipl
    -0.73
    isance
    -0.68
     Bread
    -0.66
    VD
    -0.66
    perse
    -0.65
     Roses
    -0.63
    POSITIVE LOGITS
    itol
    1.21
    aic
    0.93
    itals
    0.87
    illary
    0.86
    rison
    0.82
    stan
    0.78
    itated
    0.78
    aign
    0.76
    uchin
    0.75
    acious
    0.74
    Act Density 0.012%

    No Known Activations