INDEX
Explanations
phrases related to trends or analyses thereof
references to trends or changes over time
New Auto-Interp
Negative Logits
gur
-0.94
tein
-0.75
oath
-0.73
vette
-0.69
sacrific
-0.63
emies
-0.62
Lak
-0.62
dos
-0.61
straw
-0.61
ongh
-0.61
POSITIVE LOGITS
setting
1.27
trend
1.04
Trend
0.88
toward
0.87
trends
0.87
chart
0.85
set
0.82
towards
0.82
Trend
0.82
line
0.81
Activations Density 0.033%