INDEX
    Explanations

    Recht, Wettbewerb, Geschmack, Wohnung, Mittag, Gesundheit, Planung

    New Auto-Interp
    Negative Logits
    0.38
    UTONIUM
    0.38
     منظر
    0.38
    0.36
    0.36
    fromj
    0.36
     birdies
    0.35
     Seconds
    0.34
    leon
    0.34
    उँ
    0.34
    POSITIVE LOGITS
    ss
    1.30
    sv
    1.21
    sg
    1.20
    ssch
    1.15
    sprogram
    1.13
    spro
    1.12
    sw
    1.10
    ssystem
    1.10
    sthe
    1.08
    sm
    1.06
    Act Density 0.015%

    No Known Activations