INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     satellite
    -0.09
    Satellite
    -0.09
     stunt
    -0.09
     Satellite
    -0.09
     Reel
    -0.09
     Sticker
    -0.09
     antenna
    -0.08
     satellites
    -0.08
     sticker
    -0.08
    banner
    -0.08
    POSITIVE LOGITS
     Nietzsche
    0.21
     philosophers
    0.20
     philosophical
    0.20
    0.19
     philosopher
    0.18
     filóso
    0.18
     философ
    0.18
     thinkers
    0.17
     filoz
    0.16
     philos
    0.16
    Act Density 0.065%

    No Known Activations