INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     instagram
    -0.11
    点赞
    -0.10
     Instagram
    -0.10
     indie
    -0.10
     cannabino
    -0.10
    Instagram
    -0.10
     emoji
    -0.09
     Playground
    -0.09
    instagram
    -0.09
     nerve
    -0.09
    POSITIVE LOGITS
     COB
    0.10
     ANSI
    0.09
     IBM
    0.09
    IBM
    0.09
     VBA
    0.09
     DOE
    0.09
     Andersen
    0.09
     memorandum
    0.09
     Motorola
    0.09
     Delphi
    0.09
    Act Density 0.015%

    No Known Activations