INDEX
    Explanations

    references to iron

    New Auto-Interp
    Negative Logits
     Guilford
    -0.90
    ParallelGroup
    -0.84
     Predators
    -0.77
     Schulte
    -0.75
     Cuth
    -0.74
     Lilith
    -0.73
     Stodd
    -0.73
     Navarre
    -0.72
     navideñas
    -0.71
    にほんブログ村
    -0.69
    POSITIVE LOGITS
     iron
    1.36
     Iron
    1.33
    Iron
    1.30
     IRON
    1.09
    iron
    1.05
    IRON
    0.92
     irons
    0.88
     Irons
    0.82
     */
    
    0.81
    0.79
    Act Density 0.008%

    No Known Activations