INDEX
Explanations
references to truth and reality
New Auto-Interp
Negative Logits
gyz
-0.64
CodeAttribute
-0.61
AssemblyCompany
-0.59
genuine
-0.58
addPreferredGap
-0.57
genuine
-0.56
yarnpkg
-0.56
nodoc
-0.56
AddTagHelper
-0.54
/***/
-0.52
POSITIVE LOGITS
Truths
0.74
truths
0.71
fulness
0.66
TRUTH
0.62
Truth
0.62
serum
0.59
realities
0.59
Reality
0.57
Reality
0.57
truth
0.56
Activations Density 0.076%