INDEX
Explanations
statements or remarks made by people
phrases that indicate statements or remarks made by individuals
New Auto-Interp
Negative Logits
ventures
-0.82
inav
-0.80
Benefits
-0.79
population
-0.76
letal
-0.74
strugg
-0.74
Flavoring
-0.73
iencies
-0.72
axis
-0.71
ulnerability
-0.70
POSITIVE LOGITS
uttered
1.26
echoed
1.22
prophetic
1.17
laced
1.17
sarcastic
1.17
insulting
1.15
scathing
1.14
inflammatory
1.13
provocative
1.12
retweet
1.12
Activations Density 0.270%