INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     للمعارف
    -0.84
    berdayakan
    -0.77
     property
    -0.77
    ineno
    -0.76
     Property
    -0.74
     Италијани
    -0.74
     समीक्षक
    -0.72
    Portale
    -0.71
     properties
    -0.71
    Property
    -0.69
    POSITIVE LOGITS
     (
    0.41
     man
    0.41
     Gleich
    0.41
     Confusion
    0.41
     Educação
    0.39
     giy
    0.38
     graduation
    0.38
     injured
    0.37
     lieben
    0.37
    abstractmethod
    0.37
    Act Density 0.087%

    No Known Activations