INDEX
Explanations
elements and attributes in structured data or code
New Auto-Interp
Negative Logits
Jordan
-0.20
047
-0.18
Ĩ
-0.18
Dick
-0.18
nuclear
-0.18
Nuclear
-0.17
Gro
-0.16
147
-0.16
/cop
-0.16
Jordan
-0.16
POSITIVE LOGITS
45
0.32
485
0.30
85
0.30
cotton
0.29
845
0.29
Cotton
0.27
045
0.24
185
0.23
285
0.23
245
0.23
Activations Density 0.028%