INDEX
Explanations
mentions of technological devices or tools
phrases that denote the state or existence of something
New Auto-Interp
Negative Logits
Unemployment
-0.67
inav
-0.66
eday
-0.65
robat
-0.65
PsyNetMessage
-0.65
Mobility
-0.64
Dag
-0.64
Job
-0.64
Lilly
-0.62
Explosion
-0.62
POSITIVE LOGITS
able
1.17
held
0.86
eaten
0.84
falls
0.81
considered
0.81
discharged
0.81
disposed
0.81
hemoth
0.80
acons
0.79
seen
0.78
Activations Density 0.054%