INDEX
Explanations
initials or acronyms with periods
New Auto-Interp
Negative Logits
theless
-0.71
adobe
-0.65
schild
-0.65
caps
-0.64
favour
-0.62
caps
-0.61
discour
-0.59
constraints
-0.59
centre
-0.59
foreseeable
-0.56
POSITIVE LOGITS
.,
1.76
.?
1.57
.:
1.42
.;
1.35
.,"
1.31
.—
1.17
./
1.17
.-
1.07
.),
1.06
.–
0.96
Activations Density 0.477%