INDEX
    Explanations

    Arbitrary information

    New Auto-Interp
    Negative Logits
     turning
    -1.29
     turn
    -1.20
    Turning
    -1.01
     Turning
    -1.00
     turns
    -0.98
     turned
    -0.97
    turning
    -0.93
    turn
    -0.92
     Turn
    -0.79
    NewUrlParser
    -0.76
    POSITIVE LOGITS
     Ανακτήθηκε
    0.57
    telor
    0.52
    etan
    0.51
     alive
    0.48
    abestanden
    0.47
     appreciating
    0.47
    horabuena
    0.47
    MergeFrom
    0.46
    TTC
    0.46
    Rode
    0.46
    Act Density 0.035%

    No Known Activations