INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    OGND
    -1.03
    Portale
    -0.68
     disambiguazione
    -0.68
    httphttps
    -0.65
    tableFuture
    -0.65
    లాలు
    -0.63
     فريبيس
    -0.62
     meisje
    -0.62
    Климат
    -0.61
     preghiera
    -0.61
    POSITIVE LOGITS
     ability
    0.55
     propensity
    0.55
     affinity
    0.54
     connection
    0.53
     sense
    0.53
     presence
    0.51
     will
    0.50
     relation
    0.49
     association
    0.49
     visual
    0.49
    Act Density 0.005%

    No Known Activations