INDEX
    Explanations

    confirmation

    New Auto-Interp
    Negative Logits
     itſelf
    -0.94
     myſelf
    -0.92
     Jefus
    -0.88
     Majefty
    -0.87
     Theſe
    -0.86
     themſelves
    -0.84
     houſe
    -0.84
    hatenablog
    -0.84
    ſelf
    -0.83
    ſelves
    -0.83
    POSITIVE LOGITS
    Abitanti
    0.52
    parsedMessage
    0.49
    jep
    0.47
     dente
    0.45
    mybatisplus
    0.44
    ありません
    0.43
     jó
    0.42
     GP
    0.41
     afin
    0.41
    khart
    0.41
    Act Density 1.939%

    No Known Activations