INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .Infrastructure
    -0.08
     charging
    -0.08
    -face
    -0.07
     amid
    -0.07
    }}{{
    -0.07
     attended
    -0.07
     fought
    -0.07
     sustained
    -0.07
     attitudes
    -0.07
     Produced
    -0.07
    POSITIVE LOGITS
    zoom
    0.10
     viewer
    0.10
     zoom
    0.09
    _zoom
    0.09
    Zoom
    0.09
     visualize
    0.09
     Zoom
    0.08
     Viewer
    0.08
     visualization
    0.08
    .zoom
    0.08
    Act Density 0.004%

    No Known Activations