INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    eners
    -1.01
    enegger
    -0.85
    colo
    -0.84
    ebook
    -0.74
    advertisement
    -0.73
     Gutenberg
    -0.73
    enance
    -0.69
    etting
    -0.69
    sticks
    -0.69
    ijn
    -0.65
    POSITIVE LOGITS
     Airlines
    1.14
     Anchorage
    1.12
     Springs
    1.10
     Highlands
    0.92
     Native
    0.88
     Pradesh
    0.87
    ansas
    0.85
     Alaska
    0.83
    aska
    0.79
    ħĭ
    0.78
    Act Density 0.004%

    No Known Activations