INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     massively
    -0.07
     corpus
    -0.07
     levy
    -0.07
     Levy
    -0.07
     Nass
    -0.07
     Moving
    -0.07
     либо
    -0.07
     Burr
    -0.07
    SEA
    -0.06
     Levi
    -0.06
    POSITIVE LOGITS
     friends
    0.13
     friend
    0.12
     Friend
    0.10
     Friends
    0.10
    Friends
    0.10
    Friend
    0.09
     vriend
    0.08
     FRIEND
    0.08
    OrDefault
    0.08
    friends
    0.08
    Act Density 0.040%

    No Known Activations