INDEX
Explanations
specific symbols or unique characters
New Auto-Interp
Negative Logits
eyer
-0.14
PC
-0.14
Pc
-0.14
Utf
-0.13
computer
-0.13
Foreign
-0.13
cury
-0.13
COMPUTER
-0.13
Cyr
-0.13
/inet
-0.13
POSITIVE LOGITS
Governance
0.29
governance
0.27
Learning
0.22
Contributors
0.21
community
0.21
Contributor
0.20
Community
0.20
OSS
0.20
Steering
0.20
governed
0.19
Activations Density 0.003%