INDEX
    Explanations

    statements that emphasize the truth or correctness of a claim

    New Auto-Interp
    Negative Logits
     andato
    -0.55
     raggiunto
    -0.54
    bouncycastle
    -0.51
    ServiceTest
    -0.49
     losses
    -0.48
    -0.47
    ".$_
    -0.46
     usati
    -0.46
     στις
    -0.46
     usato
    -0.46
    POSITIVE LOGITS
     actually
    1.27
     indeed
    1.24
    indeed
    1.19
    actually
    1.16
    Actually
    1.14
     Actually
    1.13
     Indeed
    1.12
    Indeed
    1.11
     faktisk
    1.01
    etheless
    1.01
    Act Density 0.066%

    No Known Activations