INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    beforeAll
    -0.45
    ==========
    -0.44
     fatti
    -0.44
    riwal
    -0.43
     Отечественной
    -0.42
     Usaha
    -0.42
    roquia
    -0.41
    getValues
    -0.41
     Weihnachten
    -0.40
    impresa
    -0.40
    POSITIVE LOGITS
     longer
    0.75
    longer
    0.70
    不再
    0.68
    жнему
    0.67
     Longer
    0.59
     enää
    0.58
     betweenstory
    0.56
    Longer
    0.55
    omore
    0.53
    rungsseite
    0.53
    Act Density 0.005%

    No Known Activations