INDEX
Explanations
phrases related to observations or investigations
New Auto-Interp
Negative Logits
Mariners
-0.70
Priest
-0.60
Engineers
-0.58
priesthood
-0.57
NAACP
-0.56
Adin
-0.56
Zeit
-0.55
Fey
-0.55
DOT
-0.55
Ment
-0.54
POSITIVE LOGITS
beforehand
0.91
throughout
0.83
during
0.81
dearly
0.79
.:
0.78
before
0.75
regarding
0.72
.</
0.71
whilst
0.70
.?
0.70
Activations Density 1.572%