INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     finance
    -0.07
    ioneer
    -0.07
     лю
    -0.06
    .Style
    -0.06
    _smooth
    -0.06
     smokers
    -0.06
     infrastructure
    -0.06
     Brewery
    -0.06
    OLDER
    -0.06
    Scheduler
    -0.06
    POSITIVE LOGITS
     path
    0.15
     Path
    0.12
     paths
    0.10
     Paths
    0.09
    0.09
     pathway
    0.08
    -path
    0.08
    path
    0.08
     swath
    0.07
     للس
    0.07
    Act Density 0.012%

    No Known Activations