INDEX
Explanations
names or titles in quotation marks
quotation marks or speech indicators
New Auto-Interp
Negative Logits
reconcil
-0.76
appointments
-0.73
repay
-0.70
populated
-0.70
peripheral
-0.70
menstrual
-0.69
replication
-0.68
aggregate
-0.67
childbirth
-0.66
furn
-0.66
POSITIVE LOGITS
Fat
1.17
Skip
1.11
Big
1.06
Rust
1.05
Golden
1.04
Spirit
1.04
Fuck
1.04
Bob
1.02
Bull
1.02
Hope
1.00
Activations Density 0.077%