INDEX
Explanations
phrases related to analysis and evaluation
Follows question words or indicates reasoning
sentence enders
New Auto-Interp
Negative Logits
httphttps
-1.02
Efq
-1.01
raiſ
-1.01
^(@)
-0.97
ſever
-0.96
ſelves
-0.95
MigrationBuilder
-0.95
myſelf
-0.94
iſt
-0.94
?>/
-0.92
POSITIVE LOGITS
.
1.57
,
1.33
;
1.32
!
1.23
?
1.12
:
0.92
."
0.86
。
0.86
.”
0.81
.,
0.80
Activations Density 0.991%