INDEX
Explanations
numerical representations of ratings or scores
New Auto-Interp
Negative Logits
scrap
-0.86
brid
-0.79
vertisement
-0.78
grap
-0.75
thrott
-0.75
paran
-0.73
honoured
-0.72
differe
-0.72
referen
-0.70
retard
-0.70
POSITIVE LOGITS
Conclusion
1.55
CONCLUS
1.30
Advertisements
1.28
Finally
1.27
Lastly
1.24
________________________________________________________________
1.18
Others
1.18
________________________
1.18
--------------------------------------------------------
1.16
----------------------------------------------------------------
1.14
Activations Density 0.049%