INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     fertiliser
    -0.87
    uality
    -0.82
    tiness
    -0.80
     архивлан
    -0.77
    inery
    -0.76
    BeginInit
    -0.76
    OGND
    -0.76
     tartalomajánló
    -0.75
    ership
    -0.75
    èdia
    -0.75
    POSITIVE LOGITS
    a
    0.48
     status
    0.46
     assolu
    0.45
    e
    0.42
     berupa
    0.42
     like
    0.41
     present
    0.40
    echt
    0.39
    .
    0.38
     driven
    0.37
    Act Density 0.233%

    No Known Activations