INDEX
Explanations
percentages and statistical data
New Auto-Interp
Negative Logits
;"></
-0.80
')
-0.73
\"></
-0.69
len
-0.69
fla
-0.68
"),
-0.68
alie
-0.67
Fla
-0.67
LAM
-0.67
Lang
-0.66
POSITIVE LOGITS
%
1.84
\%$
1.78
\%
1.53
percent
1.49
%?
1.45
\%
1.44
percent
1.43
%
1.43
%+
1.39
%,
1.38
Activations Density 0.163%