INDEX
    Explanations

    prominent proper nouns and legal terminology related to governance and citizenship

    New Auto-Interp
    Negative Logits
    irt
    -0.16
    .usage
    -0.15
    069
    -0.15
     USAGE
    -0.14
    USAGE
    -0.14
    QUIRE
    -0.14
     Hopkins
    -0.14
    izu
    -0.13
     Nice
    -0.13
    bidden
    -0.13
    POSITIVE LOGITS
    apon
    0.16
    ault
    0.14
    -toggle
    0.14
    abet
    0.14
    kan
    0.14
    alia
    0.13
    oder
    0.13
    lime
    0.13
    ervention
    0.13
    Äįan
    0.13
    Act Density 0.000%

    No Known Activations