INDEX
    Explanations

    mentions of the French language and related cultural references

    New Auto-Interp
    Negative Logits
    extAlignment
    -0.93
    Hauptartikel
    -0.91
     Mij
    -0.86
     BBM
    -0.84
    wiada
    -0.79
    зыва
    -0.79
    awtextra
    -0.79
     Shetterly
    -0.79
    лерея
    -0.78
    RectangleBorder
    -0.77
    POSITIVE LOGITS
     French
    1.18
     France
    1.16
     francesa
    1.06
     française
    1.04
     França
    1.04
    France
    1.03
     francés
    1.02
    FRANCE
    0.99
     français
    0.97
    French
    0.96
    Act Density 0.050%

    No Known Activations