INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    24
    -0.06
     repro
    -0.06
     Rud
    -0.06
    -0.06
    フォ
    -0.06
     Igor
    -0.06
    677
    -0.06
    ERR
    -0.06
    (ch
    -0.06
    omic
    -0.06
    POSITIVE LOGITS
     capacity
    0.21
     Capacity
    0.16
    capacity
    0.11
     capacities
    0.11
    Capacity
    0.09
    _capacity
    0.07
    _CAPACITY
    0.07
    ACITY
    0.07
    ancy
    0.07
     capacidad
    0.07
    Act Density 0.007%

    No Known Activations