INDEX
    Explanations

    the term "fan" in different contexts

    New Auto-Interp
    Negative Logits
    ===============
    -0.73
     gaun
    -0.65
     Nex
    -0.63
     kj
    -0.62
     Whittaker
    -0.61
    ///////////////
    -0.60
    שה
    -0.59
     fris
    -0.59
     bé
    -0.58
    herin
    -0.57
    POSITIVE LOGITS
     fan
    1.64
    Fan
    1.64
     fans
    1.59
     FAN
    1.59
     Fan
    1.55
     Fans
    1.55
    fan
    1.53
    FAN
    1.48
    fans
    1.46
    Fans
    1.45
    Act Density 0.013%

    No Known Activations