INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Brno
    -0.07
     aura
    -0.07
    lop
    -0.07
     snaží
    -0.07
    Down
    -0.07
     equality
    -0.06
     Malone
    -0.06
    -up
    -0.06
    LEASE
    -0.06
     modeled
    -0.06
    POSITIVE LOGITS
     refriger
    0.16
     Refriger
    0.12
    riger
    0.09
    .setMax
    0.07
     reassuring
    0.07
     refrigerator
    0.07
     Brig
    0.07
     adopts
    0.06
    ارج
    0.06
     حج
    0.06
    Act Density 0.001%

    No Known Activations