INDEX
Explanations
conjunctions and short phrases indicating a sequence of actions or events
phrases related to emotional reactions and interpersonal dynamics
New Auto-Interp
Negative Logits
Compatibility
-0.60
Create
-0.57
%%
-0.56
Join
-0.56
CIS
-0.55
Locations
-0.54
tains
-0.54
nutshell
-0.53
%%
-0.53
Spread
-0.53
POSITIVE LOGITS
confessed
1.14
remarked
1.13
apologised
1.13
joked
1.11
muttered
1.08
replied
1.06
recalled
1.05
apologized
1.04
insisted
1.04
thanked
1.03
Activations Density 0.729%