INDEX
Explanations
references to varying degrees of specificity and the presence of multiple elements or aspects in a context
New Auto-Interp
Negative Logits
c
-0.61
:
-0.53
x
-0.52
pr
-0.51
-0.50
is
-0.50
,
-0.49
And
-0.49
"
-0.48
c
-0.47
POSITIVE LOGITS
المعيارى
1.28
myſelf
1.09
Theſe
1.09
resourceCulture
1.04
CloseOperation
1.00
esternos
0.96
becauſe
0.96
ſtate
0.95
Reſ
0.94
Anſ
0.93
Activations Density 0.128%