INDEX
Explanations
specific numerical values followed by cities and monetary figures
multiple instances of numerical values or statistical data
New Auto-Interp
Negative Logits
arsen
-0.69
beaut
-0.68
respons
-0.60
proposition
-0.59
dedication
-0.58
dictator
-0.57
aside
-0.56
calendars
-0.56
liberation
-0.56
adventurer
-0.55
POSITIVE LOGITS
5
1.52
0
1.46
8
1.43
3
1.43
6
1.43
9
1.41
7
1.40
2
1.37
4
1.35
1
1.33
Activations Density 0.052%