INDEX
    Explanations

    the occurrence of the word "deux" and its variations, indicating a focus on the concept of "two"

    New Auto-Interp
    Negative Logits
     Landschaft
    -0.50
     caminhada
    -0.48
    mathcal
    -0.48
    PullParser
    -0.46
     Anleitung
    -0.45
    zzleHttp
    -0.45
    baliknya
    -0.44
    การณ์
    -0.43
     beraber
    -0.43
     jaką
    -0.43
    POSITIVE LOGITS
     two
    1.04
     TWO
    0.96
     Two
    0.92
    Two
    0.87
    two
    0.84
     zwei
    0.84
     Два
    0.82
     два
    0.81
    Два
    0.81
    TWO
    0.81
    Act Density 0.001%

    No Known Activations