INDEX
    Explanations

    phrases related to legal or governmental terminology

    New Auto-Interp
    Negative Logits
    !”
    -1.24
    !’
    -1.12
    ”!
    -1.08
    ?”
    -1.01
    ,”
    -0.98
    ’?
    -0.98
    …”
    -0.94
     our
    -0.93
    ?’
    -0.92
    ”?
    -0.91
    POSITIVE LOGITS
    ).[
    0.96
    ,[
    0.88
    ‌است
    0.83
    .[
    0.82
    0.81
    0.68
     tuttavia
    0.66
    ٔ
    0.64
     wikipagina
    0.64
    0.64
    Act Density 2.224%

    No Known Activations