INDEX
Explanations
comparative phrases highlighting similarities or analogies
New Auto-Interp
Negative Logits
itſelf
-1.04
raiſ
-1.00
ſelf
-0.94
pleaſure
-0.93
Houſe
-0.92
Anſ
-0.90
Reſ
-0.88
Conſ
-0.88
houſe
-0.87
venidos
-0.86
POSITIVE LOGITS
AS
1.21
As
1.19
as
1.12
As
1.05
readAs
1.04
AS
0.85
CreateTagHelper
0.82
ValueStyle
0.81
as
0.75
follows
0.74
Activations Density 0.360%