INDEX
    Explanations

    terms related to interaction and physical contact

    New Auto-Interp
    Negative Logits
    </b>
    -0.51
    -0.48
     st
    -0.48
    ,
    -0.47
    </i>
    -0.44
    -0.43
    '
    -0.43
    ↵↵
    -0.42
    <eos>
    -0.41
    [
    -0.41
    POSITIVE LOGITS
     Efq
    1.27
    BibitemShut
    1.25
     myſelf
    1.21
     Jefus
    1.17
     للاسماء
    1.16
    bibfield
    1.15
     houſe
    1.15
    bibinfo
    1.09
     raiſ
    1.09
     Theſe
    1.07
    Act Density 0.598%

    No Known Activations