INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    izzer
    -0.08
     Jerry
    -0.07
     Grinder
    -0.06
     Simone
    -0.06
    ucer
    -0.06
     Julie
    -0.06
     Live
    -0.06
     даже
    -0.06
    OLE
    -0.06
    -0.06
    POSITIVE LOGITS
    Path
    0.20
     path
    0.19
     Path
    0.18
    path
    0.17
     paths
    0.15
    _path
    0.14
    -path
    0.13
    PATH
    0.13
     Paths
    0.13
    (path
    0.13
    Act Density 0.041%

    No Known Activations