INDEX
    Explanations

    references to flying or aviation-related activities

    New Auto-Interp
    Negative Logits
    .localization
    -0.17
    osten
    -0.15
    ectl
    -0.15
    ama
    -0.15
    .Apis
    -0.15
    poon
    -0.15
    abase
    -0.15
    rite
    -0.14
     اÙĦرÙħ
    -0.14
     kob
    -0.14
    POSITIVE LOGITS
     bi
    0.25
     mon
    0.22
     trainer
    0.22
     Bristol
    0.20
     Junk
    0.20
     float
    0.19
     Trainer
    0.19
     trainers
    0.19
    trainer
    0.18
    неÑģ
    0.18
    Act Density 0.012%

    No Known Activations