INDEX
Explanations
HTML tags and formatting elements
New Auto-Interp
Negative Logits
fourth
-0.81
Fourth
-0.80
Fourth
-0.74
fifth
-0.74
Fifth
-0.71
seventh
-0.70
fourth
-0.67
第四
-0.67
Seventh
-0.66
sixth
-0.66
POSITIVE LOGITS
secondly
0.99
Secondly
0.98
second
0.97
Second
0.90
Secondly
0.88
zwe
0.86
二
0.86
二
0.85
Kedua
0.85
Second
0.84
Activations Density 0.784%