Database: https://research.chicagobooth.edu/kilts/marketing-databases/dominicks sales data(ccount.dta): http://kilts.chicagobooth.edu/dff/store-demos-customer-count/ccount_stata.zip store data(demo.dta): http://kilts.chicagobooth.edu/dff/store-demos-customer-count/demo_stata.zip
- Download ccount.dta, demo.dta. Run dtaToCsv, and retailData.csv is generated
- Run almanac.py to get output_almanac.csv.
- All output_almanac.csv were compiled, and we had CompiledWeather.csv
- Modify first row of Compiled weather to key,c1,c2....c11
- Run topFive.py, and get top5trending.csv
- Run trendprocessing.py. This adds a column 'key' to top5trending.csv, which is 'zip/date'
- Create a folder named 'saveHMM'. Run method1.py. It performs: (a) ReadDataAndMakeHMM : dumps HMMs to 'saveHMM' folder. Create a file named 'hmmRecords.csv' ( "hmmno","key", "MaxByNormalizedQty") (b) loadHMMs : loads HMMS from folder and data from 'hmmRecords.csv' (c) main stuff : save predictions to 'predictedData.csv'
- Run compareTables.py, this gives mean value.