INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     a
    1.22
    to
    1.22
    z
    1.13
    a
    1.07
    the
    1.06
    in
    1.05
    i
    1.05
    txt
    1.02
     -
    0.98
    toare
    0.97
    POSITIVE LOGITS
     gauges
    1.46
    ри
    1.38
    '
    1.25
     gages
    1.23
    ι
    1.22
     gauge
    1.21
     gauging
    1.16
    1.15
    1.14
    1.09
    Act Density 0.001%

    No Known Activations