INDEX
    Explanations

    occurrences of the word "Lang," indicating a focus on language or linguistic references

    New Auto-Interp
    Negative Logits
    imler
    -0.15
    incoming
    -0.14
     Warren
    -0.13
    ç¿»
    -0.13
    ÅĽmy
    -0.13
    isto
    -0.13
     tard
    -0.13
    coat
    -0.13
    etur
    -0.13
     INTERRUPTION
    -0.13
    POSITIVE LOGITS
    еÑģÑı
    0.15
    -speaking
    0.15
    ÙĨÙĬÙĨ
    0.15
    stre
    0.15
    nan
    0.15
    enta
    0.15
    é̏
    0.14
    wich
    0.14
    lang
    0.14
    auge
    0.14
    Act Density 0.011%

    No Known Activations