INDEX
    Explanations

    the names of aircraft models

    New Auto-Interp
    Negative Logits
    enegger
    -0.78
    shaw
    -0.74
     humble
    -0.66
    baugh
    -0.63
     sidx
    -0.63
     cause
    -0.62
     gru
    -0.62
     Notting
    -0.62
    advertisement
    -0.62
    ebted
    -0.61
    POSITIVE LOGITS
    DF
    1.18
    TP
    1.15
    LC
    1.11
    TC
    1.11
    RP
    1.10
    OD
    1.09
    NP
    1.09
    ND
    1.06
    AC
    1.06
    DS
    1.06
    Act Density 0.069%

    No Known Activations