INDEX
Explanations
references to specific written or spoken material
relative clauses or phrases frequently starting with "that."
New Auto-Interp
Negative Logits
believing
-0.61
suspect
-0.61
idious
-0.58
say
-0.57
uty
-0.57
persuasion
-0.56
aversion
-0.56
abre
-0.56
OME
-0.55
apprehension
-0.54
POSITIVE LOGITS
accompanies
1.40
incorporates
1.14
utilizes
1.13
exists
1.12
contains
1.10
collects
1.10
spans
1.09
covers
1.08
belongs
1.07
connects
1.06
Activations Density 0.193%