INDEX
    Explanations

    quantities or counts that indicate a majority or ranking

    New Auto-Interp
    Negative Logits
     a
    -0.66
    -0.61
    -0.57
    いざ
    -0.54
     "
    -0.53
    -0.52
     '
    -0.52
    ↵↵
    -0.51
    ...
    -0.51
    a
    -0.50
    POSITIVE LOGITS
     myſelf
    1.46
     itſelf
    1.28
     Jefus
    1.28
     ―――――
    1.23
     Monfieur
    1.21
    تقاوى
    1.20
    MigrationBuilder
    1.18
     Chriftian
    1.17
     Majefty
    1.16
    +#+#
    1.16
    Act Density 0.149%

    No Known Activations