INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     allergy
    -0.07
    compet
    -0.07
     punches
    -0.06
    dbus
    -0.06
     tight
    -0.06
    Travel
    -0.06
     neutral
    -0.06
    tick
    -0.06
     Поп
    -0.06
    izziness
    -0.06
    POSITIVE LOGITS
     sole
    0.11
     Sole
    0.10
    sole
    0.09
     solvent
    0.06
    .setAction
    0.06
     leds
    0.06
    (savedInstanceState
    0.06
     whole
    0.06
    shaled
    0.06
    .core
    0.06
    Act Density 0.002%

    No Known Activations