INDEX
    Explanations

    occurrences of the French pronouns and their variations

    New Auto-Interp
    Negative Logits
    trash
    -0.16
    488
    -0.15
    738
    -0.15
    teness
    -0.15
    еÑĦ
    -0.14
    erts
    -0.14
    одо
    -0.14
    asser
    -0.14
    pus
    -0.13
    onso
    -0.13
    POSITIVE LOGITS
    ATAL
    0.15
    Ģ
    0.15
    íķĢ
    0.15
     wen
    0.14
    Ø´ÙĬ
    0.14
    мом
    0.14
    ENTS
    0.14
     PROCUREMENT
    0.14
    elden
    0.14
    ắng
    0.13
    Act Density 0.009%

    No Known Activations