INDEX
    Explanations

    words related to breaking or exceeding limits or rules

    instances of the abbreviation "AF" or variations of "af."

    New Auto-Interp
    Negative Logits
     Fargo
    -0.72
     Patriarch
    -0.69
     Mothers
    -0.66
    DOWN
    -0.64
     disinfect
    -0.64
     Mour
    -0.63
    PASS
    -0.63
    MER
    -0.62
     expectancy
    -0.61
    offer
    -0.61
    POSITIVE LOGITS
    rican
    1.26
    rica
    1.16
    icion
    1.09
    ghan
    0.96
    riad
    0.96
    avorite
    0.95
    athom
    0.93
    riend
    0.90
    eties
    0.89
    eatures
    0.87
    Act Density 0.010%

    No Known Activations