INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    .
    -0.37
    (
    -0.36
    \
    -0.35
     generation
    -0.34
     laboratories
    -0.33
     pores
    -0.33
     infringe
    -0.33
     head
    -0.32
    Read
    -0.32
     screen
    -0.32
    POSITIVE LOGITS
     ostavi
    0.85
     transfieras
    0.83
     Roskov
    0.80
    Personendaten
    0.79
    الحياه
    0.78
     typelib
    0.77
    Geplaatst
    0.76
    abestanden
    0.76
    0.75
     >=",
    0.74
    Act Density 0.006%

    No Known Activations