INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ImageContext
    -0.67
     comp
    -0.50
     gim
    -0.50
    URLException
    -0.50
    edar
    -0.49
    vested
    -0.48
     experiments
    -0.48
     conduct
    -0.48
     cost
    -0.48
    MemoryWarning
    -0.48
    POSITIVE LOGITS
     بيها
    0.58
    +#+#
    0.54
    CloseOperation
    0.54
     Politica
    0.53
     échou
    0.52
    σθαι
    0.50
     arrêté
    0.50
    ChildScrollView
    0.49
    protoimpl
    0.49
    Frauen
    0.48
    Act Density 0.004%

    No Known Activations