INDEX
Explanations
elements related to statistics and quantitative data points
New Auto-Interp
Negative Logits
éĩĮçļĦ
-0.18
endi
-0.18
far
-0.18
_far
-0.18
Away
-0.17
å¹¹
-0.16
Far
-0.15
ä¼ij
-0.15
quet
-0.15
assin
-0.14
POSITIVE LOGITS
above
0.39
below
0.31
above
0.28
Above
0.28
_above
0.27
Above
0.27
ABOVE
0.22
_below
0.22
below
0.22
Below
0.20
Activations Density 0.158%