INDEX
Explanations
references to structured objectives or aims in research contexts, especially involving tables and data representations
"2" after specific tokens
numbers followed by 2
New Auto-Interp
Negative Logits
fourth
-0.99
Fourth
-0.88
Fourth
-0.87
fifth
-0.86
sixth
-0.85
seventh
-0.84
ninth
-0.80
第四
-0.79
fourth
-0.78
eighth
-0.78
POSITIVE LOGITS
secondly
1.17
Secondly
1.09
second
1.02
Kedua
0.99
Secondly
0.97
二
0.95
Second
0.94
deux
0.94
kedua
0.93
zwe
0.93
Activations Density 0.510%