INDEX
Explanations
numbers indicating a specific ranking or score in a range from 0 to 10
numeric values and references to measurements or statistics
New Auto-Interp
Negative Logits
netflix
-0.84
EStream
-0.76
rette
-0.72
TextColor
-0.72
deen
-0.71
ramid
-0.67
raviolet
-0.67
sole
-0.66
propriet
-0.66
quet
-0.65
POSITIVE LOGITS
85
0.93
20
0.88
40
0.88
^{0.88
84
0.86
81
0.86
82
0.86
400
0.85
acity
0.84
92
0.84
Activations Density 0.060%