INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.49
     enough
    -0.44
    ex
    -0.41
    -0.41
     is
    -0.41
    ,
    -0.40
    ↵↵
    -0.40
     I
    -0.40
    ********
    -0.39
     best
    -0.38
    POSITIVE LOGITS
     pleaſure
    0.66
    :✨
    0.61
    Personensuche
    0.60
    0.55
     protoimpl
    0.53
    PerformLayout
    0.53
    Autoritní
    0.52
    GraphicsUnit
    0.50
    OGND
    0.50
    wpi
    0.50
    Act Density 1.674%

    No Known Activations