INDEX
Explanations
evidence and confirmation related to scientific studies and hypotheses
New Auto-Interp
Negative Logits
-0.71
RectangleBorder
-0.67
发表于
-0.60
XmlIgnore
-0.59
hâm
-0.57
Diweddarwch
-0.57
(!__
-0.57
WriteBarrier
-0.57
AccessorTable
-0.56
مشين
-0.56
POSITIVE LOGITS
Confirmed
0.61
confirmed
0.60
testemun
0.57
eyewitness
0.57
corroborated
0.56
anecdotal
0.55
confirmed
0.54
inferred
0.53
Confirmed
0.51
φαλ
0.51
Activations Density 0.960%