INDEX
    Explanations

    references to trails and pathways in natural settings

    New Auto-Interp
    Negative Logits
     hills
    -0.17
     vit
    -0.16
     Hills
    -0.15
    lac
    -0.15
     highways
    -0.15
     mountains
    -0.14
     Darkness
    -0.14
     exped
    -0.14
     downtown
    -0.14
     routes
    -0.13
    POSITIVE LOGITS
     signed
    0.19
    signed
    0.19
     faint
    0.17
    ury
    0.17
     scram
    0.17
     unsigned
    0.16
    bench
    0.16
    unsigned
    0.15
     switch
    0.15
     gint
    0.15
    Act Density 0.005%

    No Known Activations