INDEX
Explanations
occurrences of numbers or numerical references in the text
Appearing before certain numbers
numbers and specific counts
New Auto-Interp
Negative Logits
&
-0.70
-0.61
-
-0.57
..
-0.52
.....
-0.51
↵↵↵
-0.50
C
-0.49
…..
-0.48
=
-0.48
______
-0.47
POSITIVE LOGITS
ſta
0.87
iſt
0.87
raiſ
0.80
myſelf
0.79
ſche
0.79
يتيمه
0.79
ſever
0.78
ſy
0.77
ſch
0.77
InputBorder
0.77
Activations Density 0.424%