INDEX
Explanations
phrases indicating a change in circumstances or outcomes
phrases indicating change or deterioration in situations
New Auto-Interp
Negative Logits
amiya
-0.77
Reincarn
-0.73
Preservation
-0.70
ende
-0.67
Recomm
-0.64
egu
-0.63
esting
-0.63
Refugees
-0.62
Maiden
-0.59
Joined
-0.58
POSITIVE LOGITS
hairy
1.12
ugly
1.03
complicated
1.00
tricky
0.99
messy
0.98
worse
0.95
nasty
0.94
interesting
0.93
crazy
0.91
murky
0.89
Activations Density 0.116%