INDEX
    Explanations

    mentions of language and its various forms

    New Auto-Interp
    Negative Logits
    ücks
    -0.71
    TypeDef
    -0.58
    -0.57
     greateſt
    -0.56
    ſelves
    -0.55
    ſelf
    -0.55
     occafion
    -0.55
     reaſon
    -0.54
     preſent
    -0.54
     microm
    -0.54
    POSITIVE LOGITS
     Lang
    0.89
     LANG
    0.87
     Language
    0.78
    Languages
    0.73
     languages
    0.73
     Langu
    0.73
    Lang
    0.71
     язы
    0.71
     Languages
    0.70
    auge
    0.70
    Act Density 0.106%

    No Known Activations