INDEX
Explanations
words related to controversy or sensitive topics
phrases related to donations or support
New Auto-Interp
Negative Logits
apprentices
-0.74
endeavour
-0.73
volunt
-0.72
lling
-0.71
ilibrium
-0.71
avering
-0.70
uly
-0.67
principally
-0.67
endeav
-0.66
consecut
-0.66
POSITIVE LOGITS
Anyway
1.36
Advertisement
1.19
Seriously
1.10
UPDATE
1.00
Anyway
0.96
Liter
0.95
Yep
0.94
HAHAHAHA
0.93
Bonus
0.92
PHOTOS
0.92
Activations Density 0.678%