INDEX
    Explanations

    Australian locations/names

    New Auto-Interp
    Negative Logits
     mensaje
    -0.07
    くな
    -0.07
     başlan
    -0.07
     Raptors
    -0.06
     makeover
    -0.06
    	email
    -0.06
    anggal
    -0.06
     milyar
    -0.06
     zbyt
    -0.06
     üretim
    -0.06
    POSITIVE LOGITS
     Plumbing
    0.07
    [class
    0.07
     Topic
    0.07
     attr
    0.07
     illeg
    0.06
     Capitals
    0.06
     Tit
    0.06
     Decorating
    0.06
     Seattle
    0.06
     정치
    0.06
    Act Density 0.345%

    No Known Activations