INDEX
Explanations
proper nouns, specifically names of persons
specific identifiers or references to monetary values and distinct items
New Auto-Interp
Negative Logits
SC
-0.91
Sev
-0.86
Sparks
-0.85
Space
-0.84
к
-0.81
Suc
-0.81
Tsuk
-0.80
solid
-0.80
Spur
-0.79
Solid
-0.78
POSITIVE LOGITS
34
0.90
ami
0.83
onso
0.83
34
0.82
341
0.82
341
0.81
356
0.78
oni
0.78
Alonso
0.77
ania
0.76
Activations Density 0.377%