INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    eries
    -0.69
    vernment
    -0.68
    ggles
    -0.64
    ework
    -0.63
    alty
    -0.62
    affle
    -0.59
    icide
    -0.59
    ovi
    -0.59
    ãĤ¡
    -0.58
    ocious
    -0.57
    POSITIVE LOGITS
     atop
    1.16
     somewhere
    0.98
     inside
    0.96
     indoors
    0.93
     near
    0.90
     beside
    0.89
     elsewhere
    0.89
     therein
    0.87
     outdoors
    0.86
     Somewhere
    0.86
    Act Density 1.430%

    No Known Activations