INDEX
    Explanations

    numerical values, particularly measurements and dimensions

    New Auto-Interp
    Negative Logits
    rus
    -0.15
    iversit
    -0.15
    icipant
    -0.14
    erable
    -0.14
    LOB
    -0.14
    eno
    -0.14
    ahoma
    -0.14
    antino
    -0.13
    iris
    -0.13
    ogne
    -0.13
    POSITIVE LOGITS
    377
    0.16
    642
    0.14
    389
    0.14
    357
    0.14
    uta
    0.14
    otech
    0.14
    ħ§
    0.13
    laz
    0.13
    имв
    0.13
    loh
    0.13
    Act Density 0.082%

    No Known Activations