INDEX
Explanations
numerical values and their associated properties
New Auto-Interp
Negative Logits
<bos>
-0.86
Datuak
-0.71
InitVars
-0.65
XmlAccessType
-0.64
contentLoaded
-0.60
Palmarès
-0.59
rungsseite
-0.58
ⓧ
-0.55
retweeted
-0.55
bellow
-0.55
POSITIVE LOGITS
,</
0.87
,$_
0.83
\",\
0.76
,+
0.73
,",
0.73
–,
0.71
,\\
0.70
,&
0.70
′,
0.70
,}
0.68
Activations Density 0.132%