INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .scrollView
    -0.07
     Aussie
    -0.07
     ballots
    -0.07
     sensitive
    -0.06
     contaminants
    -0.06
    annah
    -0.06
    キー
    -0.06
    oke
    -0.06
     researchers
    -0.06
     nib
    -0.06
    POSITIVE LOGITS
     glor
    0.11
     glamour
    0.09
     Glam
    0.08
     fantas
    0.08
     glamorous
    0.07
    dimension
    0.06
    .newArrayList
    0.06
    ‌های
    0.06
    gl
    0.06
     romant
    0.06
    Act Density 0.003%

    No Known Activations