INDEX
Explanations
numerical references related to positive roots
Numbers following a dollar sign
specific numbers and symbols
New Auto-Interp
Negative Logits
desmotivaciones
-0.59
Gór
-0.57
Masyarakat
-0.52
vulnerables
-0.52
completo
-0.51
脚注の使い方
-0.51
Exteriores
-0.50
jugu
-0.50
Betyg
-0.49
يًا
-0.49
POSITIVE LOGITS
<bos>
0.79
|.
0.71
|$.
0.68
+'.
0.68
.$.
0.67
\.
0.66
Autoritní
0.65
+".
0.64
.'.
0.63
}$.
0.62
Activations Density 1.042%