INDEX
Explanations
words related to economics and numerical data related to people and money.
Punctuation marks
New Auto-Interp
Negative Logits
Theſe
-1.02
Anſ
-0.91
Diſ
-0.90
Reſ
-0.86
Houſe
-0.83
faſt
-0.81
purpoſe
-0.79
themſelves
-0.78
becauſe
-0.77
houſe
-0.77
POSITIVE LOGITS
,
0.61
-
0.54
–
0.52
.
0.48
—
0.45
;
0.44
!
0.43
ParallelGroup
0.43
−
0.41
represent
0.40
Activations Density 6.510%