INDEX
    Explanations

    identifying definitions

    New Auto-Interp
    Negative Logits
    æĽ°
    -0.10
     Tribe
    -0.09
     tant
    -0.09
     vo
    -0.09
     corresponding
    -0.08
     temper
    -0.08
     symbolism
    -0.08
    olan
    -0.08
    748
    -0.08
     Kendrick
    -0.08
    POSITIVE LOGITS
     refers
    0.39
     refer
    0.38
     referring
    0.31
    ref
    0.29
    refer
    0.28
    æĮĩ
    0.27
     Ref
    0.24
     Refer
    0.23
    Refer
    0.23
    _refer
    0.21
    Act Density 0.214%

    No Known Activations