INDEX
Explanations
capacity, resources, or ability
New Auto-Interp
Negative Logits
精彩
0.46
Valuable
0.44
有用
0.42
Useful
0.42
useful
0.41
суть
0.40
useful
0.40
精华
0.40
本质
0.39
Useful
0.38
POSITIVE LOGITS
bandwidth
0.98
capacity
0.86
means
0.85
ability
0.83
inclination
0.80
capability
0.79
bandwidth
0.79
Means
0.76
capacidade
0.74
resources
0.72
Activations Density 0.030%