INDEX
    Explanations

    references to natural environments, specifically forests

    New Auto-Interp
    Negative Logits
    odder
    -0.73
    ampion
    -0.67
    ork
    -0.66
    bour
    -0.65
    rament
    -0.65
    ool
    -0.63
     Ack
    -0.63
    obb
    -0.62
    aba
    -0.62
    afa
    -0.61
    POSITIVE LOGITS
     to
    0.97
    to
    0.89
    onte
    0.87
     thereto
    0.82
     TO
    0.76
    )=(
    0.71
     Nanto
    0.70
    TO
    0.69
     obliged
    0.68
    İ
    0.67
    Act Density 0.251%

    No Known Activations