INDEX
    Explanations

    mentions of domain-specific technical terms and model/product identifiers within instructional or explanatory text, especially when listing examples or specifications.

    New Auto-Interp
    Negative Logits
    0.33
    NANA
    0.32
    vykor
    0.30
     বলিয়াই
    0.29
    TARGETTING
    0.29
    OpportunitiesBy
    0.29
    सरकार
    0.29
    કિસ્
    0.29
     मुसलमान
    0.28
    0.28
    POSITIVE LOGITS
    ,
    0.47
     (
    0.42
     for
    0.37
     
    0.37
     à
    0.37
     For
    0.35
     A
    0.35
     to
    0.34
     +
    0.34
     de
    0.34
    Act Density 1.441%

    No Known Activations