INDEX
    Explanations

    specific words in various languages (particularly Spanish and French) related to objects or items

    New Auto-Interp
    Negative Logits
    theless
    -0.92
     iſt
    -0.86
    Germain
    -0.78
     ―――――
    -0.76
     ་་
    -0.72
    jména
    -0.72
    ly
    -0.72
     ISTAT
    -0.70
     impar
    -0.70
     Mahomet
    -0.70
    POSITIVE LOGITS
    Σε
    1.08
     à
    1.07
     σε
    0.86
     zu
    0.85
    ]<<
    0.84
     BorderRadius
    0.84
    Về
    0.84
    andExpect
    0.83
    }}]{
    0.82
     the
    0.82
    Act Density 0.015%

    No Known Activations