INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    GeoNames
    -0.66
    %;
    
    -0.60
     has
    -0.56
     refers
    -0.56
    はじめに
    -0.55
    ');
    
    -0.53
    Aktualisiert
    -0.53
    ніципалі
    -0.53
     includes
    -0.53
     may
    -0.51
    POSITIVE LOGITS
    autonomie
    0.57
    adpleegd
    0.55
     Such
    0.55
     therefrom
    0.54
     astfel
    0.54
     graças
    0.54
     andererseits
    0.52
    complexContent
    0.51
    ennemi
    0.50
     enfans
    0.50
    Act Density 0.369%

    No Known Activations