INDEX
    Explanations

    json structures with proxies

    New Auto-Interp
    Negative Logits
     भूमिका
    0.42
    smoothing
    0.39
    емо
    0.38
     masculino
    0.37
    بە
    0.37
    Looking
    0.37
     embodying
    0.37
    Smoothing
    0.37
    मोबाइल
    0.36
    Gene
    0.35
    POSITIVE LOGITS
    prox
    0.50
     proxies
    0.44
     CAC
    0.44
     metac
    0.44
     clamp
    0.43
    oprop
    0.42
     clamps
    0.41
     dict
    0.41
     proxy
    0.40
     foc
    0.39
    Act Density 0.006%

    No Known Activations