INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     gales
    -0.82
     avoient
    -0.80
     masts
    -0.79
     Chaldean
    -0.79
     catastrophes
    -0.78
     Sarm
    -0.75
     étoient
    -0.75
     mace
    -0.74
     hurricanes
    -0.73
     whiteboard
    -0.72
    POSITIVE LOGITS
    Демографія
    0.69
     henvisninger
    0.65
     der
    0.64
     and
    0.60
     for
    0.56
     ins
    0.56
     di
    0.56
     Ho
    0.56
     K
    0.55
     per
    0.55
    Act Density 0.126%

    No Known Activations