INDEX
Explanations
references and instances related to specificity in content
New Auto-Interp
Negative Logits
_specific
-0.24
specifically
-0.23
Specifically
-0.22
especÃŃf
-0.22
specific
-0.22
pecific
-0.21
Specific
-0.20
specific
-0.19
اÙĨÙĩ
-0.18
Specific
-0.17
POSITIVE LOGITS
ities
0.43
ity
0.40
ially
0.39
ally
0.39
ily
0.33
ized
0.30
-purpose
0.29
ALLY
0.27
ied
0.26
kind
0.25
Activations Density 0.088%