INDEX
Explanations
phrases introducing new information or situations
the word "that" in various contexts
New Auto-Interp
Negative Logits
Expect
-0.68
*/
-0.64
ggles
-0.62
Join
-0.61
fts
-0.60
iscover
-0.60
etermin
-0.58
Legislation
-0.57
Disorders
-0.56
Sit
-0.55
POSITIVE LOGITS
resembled
1.56
consisted
1.46
amounted
1.45
lasted
1.45
resulted
1.41
culminated
1.38
differed
1.34
seemed
1.34
lacked
1.31
hadn
1.26
Activations Density 0.248%