INDEX
    Explanations

    capacity, resources, or ability

    New Auto-Interp
    Negative Logits
    精彩
    0.46
     Valuable
    0.44
    有用
    0.42
     Useful
    0.42
     useful
    0.41
     суть
    0.40
    useful
    0.40
    精华
    0.40
    本质
    0.39
    Useful
    0.38
    POSITIVE LOGITS
     bandwidth
    0.98
     capacity
    0.86
     means
    0.85
     ability
    0.83
     inclination
    0.80
     capability
    0.79
    bandwidth
    0.79
     Means
    0.76
     capacidade
    0.74
     resources
    0.72
    Act Density 0.030%

    No Known Activations