INDEX
Explanations
phrases related to providing information or recounting events
phrases related to conflicting statements or contradictory information
New Auto-Interp
Negative Logits
pioneered
-0.86
abolished
-0.77
Designed
-0.74
pioneers
-0.73
thriving
-0.72
redesigned
-0.71
Pione
-0.69
rebuilt
-0.69
requires
-0.69
Beaut
-0.68
POSITIVE LOGITS
conversation
1.41
reply
1.30
misunderstanding
1.26
remark
1.22
comment
1.21
accusation
1.21
verbal
1.20
questioning
1.19
conversations
1.19
explanation
1.17
Activations Density 0.794%