INDEX
Explanations
phrases related to cause and effect
specific recurring phrases or words that suggest rules and consequences in a narrative
New Auto-Interp
Negative Logits
Historically
-0.76
abee
-0.75
ibaba
-0.75
estone
-0.74
inspired
-0.72
allel
-0.71
Columb
-0.70
derived
-0.69
cum
-0.68
umen
-0.68
POSITIVE LOGITS
consequences
1.22
slightest
1.22
rest
1.15
odds
1.11
guy
1.10
repercussions
1.05
fuck
1.05
situation
1.04
truth
1.04
outcome
1.02
Activations Density 0.385%