INDEX
    Explanations

    instances of the word "one" or its variants in different contexts

    New Auto-Interp
    Negative Logits
     fumée
    -0.85
     suivantes
    -0.79
    Personendaten
    -0.77
     économie
    -0.74
     pérd
    -0.72
     myſelf
    -0.70
    ásban
    -0.69
     Sociales
    -0.69
     armée
    -0.69
     houſe
    -0.69
    POSITIVE LOGITS
     a
    1.18
     एक
    1.03
    एक
    1.00
     een
    0.98
     một
    0.98
     یک
    0.95
    Een
    0.94
     einem
    0.94
     Eine
    0.93
    Eine
    0.93
    Act Density 0.021%

    No Known Activations