INDEX
Explanations
phrases related to significant changes or shifts
phrases about changes in conditions or parameters
New Auto-Interp
Negative Logits
YES
-0.73
soever
-0.69
¤
-0.67
Ĭ
-0.67
hanged
-0.66
DEV
-0.66
netflix
-0.66
rs
-0.65
ague
-0.65
linger
-0.65
POSITIVE LOGITS
efficiency
1.04
regards
1.00
effic
0.99
relation
0.97
favor
0.93
accordance
0.92
lieu
0.91
animate
0.89
conjunction
0.87
order
0.85
Activations Density 0.099%