INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     الرياضيه
    -0.81
    szt
    -0.70
     Darryl
    -0.66
     Dinamarca
    -0.65
    PhysRev
    -0.65
    ccini
    -0.64
     pewter
    -0.64
    nextLine
    -0.64
     Noruega
    -0.64
     InputDecoration
    -0.63
    POSITIVE LOGITS
     island
    2.47
     Island
    2.38
    Island
    2.23
     islands
    2.19
    island
    2.15
     ISLAND
    2.10
     Islands
    2.04
    Islands
    1.95
    islands
    1.90
     ISLANDS
    1.67
    Act Density 0.028%

    No Known Activations