INDEX
Explanations
phrases that indicate alignment or consistency with a specific standard, guideline, or previous action
references to compliance or alignment with established standards or guidelines
New Auto-Interp
Negative Logits
CVE
-0.84
ilt
-0.76
ils
-0.74
soever
-0.73
ilts
-0.68
lvl
-0.66
itars
-0.64
loss
-0.63
Horror
-0.62
livest
-0.61
POSITIVE LOGITS
backer
0.86
arity
0.76
vein
0.73
OGR
0.68
below
0.68
meal
0.67
stad
0.66
dated
0.65
ups
0.64
lineage
0.63
Activations Density 0.021%