INDEX
    Explanations

    specific numerical values

    sequences of characters or patterns resembling "x" followed by numbers

    New Auto-Interp
    Negative Logits
     Hab
    -0.75
     breath
    -0.62
     deserted
    -0.61
     Nun
    -0.60
     washed
    -0.60
     balancing
    -0.59
     relied
    -0.59
     cultured
    -0.59
     forgiven
    -0.58
     Labrador
    -0.58
    POSITIVE LOGITS
    x
    3.85
    xes
    2.85
    xc
    2.29
    xb
    2.25
    xa
    2.23
    xd
    2.23
    xf
    2.23
    xff
    2.15
    xe
    2.15
    xs
    2.11
    Act Density 0.017%

    No Known Activations