INDEX
Explanations
mentions of specific items or actions in various segments of information
sentence-ending punctuation or phrases indicating a strong conclusion
New Auto-Interp
Negative Logits
tremend
-0.85
challeng
-0.84
advoc
-0.80
warr
-0.80
intermediate
-0.79
concess
-0.79
carbohyd
-0.78
imb
-0.77
compr
-0.77
encount
-0.75
POSITIVE LOGITS
According
1.36
Apparently
1.26
Yesterday
1.22
Nope
1.22
Luckily
1.20
Instead
1.18
Thankfully
1.18
Turns
1.18
Consider
1.16
Earlier
1.14
Activations Density 0.473%