INDEX
Explanations
situations or events involving conflict or disruption
New Auto-Interp
Negative Logits
CI
-0.68
cise
-0.62
selves
-0.60
ancest
-0.57
lug
-0.57
whereas
-0.56
dit
-0.54
Know
-0.53
Pixel
-0.53
Sandra
-0.53
POSITIVE LOGITS
arises
1.16
goes
1.14
arose
1.14
exists
1.12
explodes
1.11
flowed
1.11
disappears
1.10
becomes
1.10
begins
1.09
continues
1.08
Activations Density 2.301%