INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Segue
    -0.10
     Innovative
    -0.10
     Ingen
    -0.09
    .ru
    -0.09
     Signature
    -0.09
     Zwe
    -0.09
    eco
    -0.08
     ded
    -0.08
     Reco
    -0.08
    BCM
    -0.08
    POSITIVE LOGITS
     revolution
    0.31
     Revolution
    0.28
    éĿ©åij½
    0.25
     rein
    0.22
     future
    0.21
    revolution
    0.21
     ÑĢеволÑİ
    0.19
     Rein
    0.19
    future
    0.18
    éĿ©
    0.17
    Act Density 0.183%

    No Known Activations