FILES ON LIBRA ============== Most files for running the system are under /ReportingThird/stats/UDW/Install. bin/runUDWPrep.sh is script for running system. This uses: script/Dir_UDW.config - directory configuration script/utilities.sh - defines 'echots' and 'sendmail' functions Creates config file 'config/udw-conf.xml' from 'config/udw-conf.xml.master' by setting start & end dates to current day. Runs program udwreport. Output goes to /ReportingThird/stats/UDW/output: UDW_WP_User - ConsumerFileName UDW_WP_Session - SessionFileName UDW_WP_Usage_Fact - UsageFactFileName UDW_WP_Access_Fact - AccessFactFileName UDW_WP_Customer_Contract - CustomerContractFileName Files dumped from Wispers database: in /ReportingFourth/stats/Reporting/Data: CollDoc.bcp - DocumentFileName ContentType.dbd - ContentTypeFileName FreeArticles.dbd - FreeArticleFileName OATimeStamp.dat - OpenAccessFileName Consumer.dbd - ConsumerFileName SuperUser.dbd - SuperUserFileName License.bcp - LicenseFileName LicenseProducts.bcp - LicenseProductFileName ProductCollections.bcp - ProductCollectionFileName Hierarchy.bcp - HierarchyFileName in /ReportingThird/stats/UDW/Persistent: SessionCount.dat - SessionCountFileName SequenceMarker.dat - SequenceMarkerFileName ------------------------------------------------------------------------------ Data for DW loading (from Zhiming's diagram) ------------------- Usage Fact (800,000 records/daily) Session key Product ID Sale Model Contract code Date Type User Customer account Year Session sequence Usage flag Type flag License Access Fact (8 million total) Product ID Sale Model Contract code Date User Customer account Year Type flag License Product (1.5 million total) Product ID Volume Issue Part OA date OID DOI Title Author Publication date Type code Print ID Customer Contract (100,000 records daily) Customer account ID License ID Contract year Contract type Registration date First use date Customer Dimension (150K total) Account ID Name Type Country Email License Dimension (1.2M total) License ID Description Type code Type description Start date End date Session Dimension (150,000 records daily) Session ID Entry page Exit page Referrer page Referrer site CID cookie PCLT cookie Authentication method Start time User Dimension (70,000 records daily) Account code IP Country code Email