INDEX
    Explanations

    terms related to measurements of span or distance

    arm span to height ratio

    New Auto-Interp
    Negative Logits
    y
    -0.43
    <bos>
    -0.41
     מוכ
    -0.40
    it
    -0.39
     Ju
    -0.38
     jij
    -0.38
    InjectMocks
    -0.38
    -0.38
    -0.37
    ie
    -0.36
    POSITIVE LOGITS
     Span
    1.19
    span
    1.16
    Span
    1.14
     SPAN
    1.12
     span
    1.08
    SPAN
    1.02
    spans
    0.93
    TextSpan
    0.85
     spans
    0.85
    spanning
    0.85
    Act Density 0.009%

    No Known Activations