INDEX
    Explanations

    references to specific cultural or geographic institutions and their statuses

    New Auto-Interp
    Negative Logits
    ussed
    -0.15
     monet
    -0.14
    моÑĢ
    -0.13
     portion
    -0.13
     descriptors
    -0.13
     Berm
    -0.13
    zion
    -0.13
    گاÙĩ
    -0.13
    duto
    -0.13
    ouve
    -0.13
    POSITIVE LOGITS
     en
    0.41
     die
    0.31
    die
    0.23
    Die
    0.23
    .en
    0.22
     DIE
    0.22
     Die
    0.21
     met
    0.21
     wa
    0.21
    ,en
    0.18
    Act Density 0.054%

    No Known Activations