INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    nesota
    -0.80
    jad
    -0.77
    phrine
    -0.74
    cloth
    -0.74
    LEASE
    -0.73
    ctive
    -0.71
    monds
    -0.71
    hirt
    -0.70
    casts
    -0.70
    culus
    -0.68
    POSITIVE LOGITS
    arians
    0.89
     Budapest
    0.89
    Hung
    0.85
    owitz
    0.84
     Hungarian
    0.84
    awei
    0.82
     Hungary
    0.80
    istani
    0.79
    naires
    0.78
     Viktor
    0.76
    Act Density 0.035%

    No Known Activations