INDEX
Explanations
dollar amounts or financial transactions
punctuation marks, specifically periods at the end of sentences
New Auto-Interp
Negative Logits
selfie
-0.86
microbiome
-0.86
plet
-0.81
compet
-0.79
emoji
-0.79
yip
-0.78
genders
-0.77
dips
-0.77
vape
-0.75
upgr
-0.74
POSITIVE LOGITS
Eventually
1.77
Shortly
1.65
Soon
1.57
Later
1.56
Afterwards
1.54
Initially
1.47
Ultimately
1.47
Accordingly
1.47
Consequently
1.46
Fortunately
1.43
Activations Density 0.346%