
Hi everyone, For listing the anomalies, is there a preference for sorting by date+station or sorting by station+date? What we do now is sort by date+station: Anomalies (flag=2): TA00682 2022-08-03 | score: 308.398 (thresh: 261.338) -- 'pr' mm t: 0.000 [17.142, 23.176] TA00686 2022-08-03 | score: 182.706 (thresh: 90.000) -- 'pr' mm t: 0.255 [0.0, 2.227, 17.142, 23.176] TA00148 2022-08-04 | score: 426.326 (thresh: 90.000) -- 'pr' mm t: 0.034 [22.4] TA00364 2022-08-04 | score: 190.747 (thresh: 79.321) -- 'pr' mm t: 1.531 [23.226] TA00398 2022-08-04 | score: 1418.571 (thresh: 90.000) -- 'pr' mm t: 0.068 [38.349] TA00682 2022-08-04 | score: 3087.313 (thresh: 261.338) -- 'pr' mm t: 0.000 [49.777, 85.636] TA00160 2022-08-08 | score: 691.851 (thresh: 266.803) -- 'pr' mm t: 1.769 [0.0, 33.714] TA00686 2022-08-08 | score: 215.214 (thresh: 90.000) -- 'pr' mm t: 0.000 [0.0, 1.769, 3.809, 33.714] TA00056 2022-08-09 | score: 218.996 (thresh: 146.763) -- 'pr' mm t: 0.000 [0.0, 0.017, 20.2] TA00196 2022-08-09 | score: 112.270 (thresh: 89.761) -- 'pr' mm t: 0.000 [0.0, 1.785, 20.2] However, if we sort by the station and then the date, IMO it's easier to see multiple anomalies for the same station: Anomalies (flag=2): TA00056 2022-08-09 | score: 218.996 (thresh: 146.763) -- 'pr' mm t: 0.000 [0.0, 0.017, 20.2] TA00148 2022-08-04 | score: 426.326 (thresh: 90.000) -- 'pr' mm t: 0.034 [22.4] TA00160 2022-08-08 | score: 691.851 (thresh: 266.803) -- 'pr' mm t: 1.769 [0.0, 33.714] TA00196 2022-08-09 | score: 112.270 (thresh: 89.761) -- 'pr' mm t: 0.000 [0.0, 1.785, 20.2] TA00364 2022-08-04 | score: 190.747 (thresh: 79.321) -- 'pr' mm t: 1.531 [23.226] TA00398 2022-08-04 | score: 1418.571 (thresh: 90.000) -- 'pr' mm t: 0.068 [38.349] TA00682 2022-08-03 | score: 308.398 (thresh: 261.338) -- 'pr' mm t: 0.000 [17.142, 23.176] TA00682 2022-08-04 | score: 3087.313 (thresh: 261.338) -- 'pr' mm t: 0.000 [49.777, 85.636] TA00686 2022-08-03 | score: 182.706 (thresh: 90.000) -- 'pr' mm t: 0.255 [0.0, 2.227, 17.142, 23.176] TA00686 2022-08-08 | score: 215.214 (thresh: 90.000) -- 'pr' mm t: 0.000 [0.0, 1.769, 3.809, 33.714] I suppose we could start making a CSV file and attaching it to the daily report, so those so inclined can sort things however they want :-) Any thoughts? Thanks, Michael From: slater@oregonstate.edu <slater@oregonstate.edu> Sent: Tuesday, August 9, 2022 11:18 PM To: rainqc-jobman@ENGR.ORST.EDU Cc: Slater, Michael <slater@oregonstate.edu> Subject: RainQC Job Manager daily report 2022-08-10 Current UTC date: 2022-08-10 -> scoring models for previous day --------------------------------------------------------------------------------- Daily Model Data Completeness Check: data completeness 50% | complete models: 74 of 222 (33.33%) data completeness 60% | complete models: 71 of 222 (31.98%) data completeness 70% | complete models: 66 of 222 (29.73%) data completeness 75% | complete models: 66 of 222 (29.73%) data completeness 80% | complete models: 66 of 222 (29.73%) data completeness 85% | complete models: 66 of 222 (29.73%) data completeness 90% | complete models: 66 of 222 (29.73%) data completeness 95% | complete models: 66 of 222 (29.73%) data completeness 100% | complete models: 66 of 222 (29.73%) ---------------------------- station status | total: 271, delayed: 129, offline 24h: 103, offline week: 35 | battery, min: 0, max: 100, mean: 57.83, std dev: 27.1 | battery, common values: [(100, 160), (0, 43), (74, 4), (65, 3), (53, 3)] | battery <= mean, common countries: [('GH', 19), ('KE', 15), ('ML', 8), ('MW', 7), ('TG', 6)] ---------------------------- 121 LOW DATA (< 0.9) and 108 NO DATA weather stations impacted 156 RainQC models LOW/NO data station impact on models: [('TA00570', 10), ('TA00127', 9), ('TA00568', 9), ('TA00447', 7), ('TA00199', 7), ('TA00078', 6), ('TA00198', 6), ('TA00677', 5), ('TA00126', 5), ('TA00160', 5), ('TA00301', 5), ('TA00370', 5), ('TA00652', 4), ('TA00586', 4), ('TA00140', 4), ('TA00074', 4), ('TA00335', 4), ('TA00391', 4), ('TA00691', 4), ('TA00680', 4), ('TA00270', 4), ('TA00537', 4), ('TA00044', 3), ('TA00041', 3), ('TA00043', 3), ('TA00050', 3), ('TA00045', 3), ('TA00178', 3), ('TA00089', 3), ('TA00091', 3), ('TA00422', 3), ('TA00170', 3), ('TA00251', 3), ('TA00180', 3), ('TA00495', 3), ('TA00319', 3), ('TA00530', 3), ('TA00533', 3), ('TA00536', 3), ('TA00655', 3), ('TA00276', 2), ('TA00308', 2), ('TA00171', 2), ('TA00101', 2), ('TA00102', 2), ('TA00257', 2), ('TA00256', 2), ('TA00118', 2), ('TA00313', 2), ('TA00165', 2), ('TA00148', 2), ('TA00467', 2), ('TA00155', 2), ('TA00168', 2), ('TA00684', 2), ('TA00289', 2), ('TA00203', 2), ('TA00300', 2), ('TA00210', 2), ('TA00262', 2), ('TA00265', 2), ('TA00267', 2), ('TA00393', 2), ('TA00271', 2), ('TA00494', 2), ('TA00385', 2), ('TA00307', 2), ('TA00339', 2), ('TA00361', 2), ('TA00367', 2), ('TA00368', 2), ('TA00398', 2), ('TA00471', 2), ('TA00466', 2), ('TA00547', 2), ('TA00664', 2), ('TA00014', 1), ('TA00031', 1), ('TA00039', 1), ('TA00282', 1), ('TA00092', 1), ('TA00109', 1), ('TA00123', 1), ('TA00128', 1), ('TA00136', 1), ('TA00254', 1), ('TA00179', 1), ('TA00201', 1), ('TA00219', 1), ('TA00033', 1), ('TA00236', 1), ('TA00287', 1), ('TA00286', 1), ('TA00237', 1), ('TA00279', 1), ('TA00392', 1), ('TA00277', 1), ('TA00290', 1), ('TA00336', 1), ('TA00350', 1), ('TA00351', 1), ('TA00322', 1), ('TA00362', 1), ('TA00375', 1), ('TA00333', 1), ('TA00373', 1), ('TA00382', 1), ('TA00383', 1), ('TA00255', 1), ('TA00231', 1), ('TA00482', 1), ('TA00036', 1), ('TA00493', 1), ('TA00543', 1), ('TA00565', 1), ('TA00542', 1), ('TA00602', 1), ('TA00639', 1), ('TA00641', 1), ('TA00638', 1), ('TA00702', 1)] ----------------------------------------------------------- Processed daily jobs for UTC date: 2022-08-09 Start time: 2022-08-10T04:57:14+00:00 End time : 2022-08-10T06:18:26+00:00 Elapsed time HH:MM:SS: 1:21:12 --------------------- Before job processing job table stats: Total 'success' count: 60 Total 'failure' count: 115 Total record count: 1145 Job history table record count: 72483 Scoring job record table record count: 334 --------------------- After job processing job table stats: Total 'success' count: 248 (flag=2 count: 10) (flag=1 abstentions: 2) | 'success' count for 2022-08-09: 66 (flag=2 count: 2) | 'success' count for 2022-08-08: 13 (flag=2 count: 2) | 'success' count for 2022-08-07: 24 (flag=2 count: 0) | 'success' count for 2022-08-06: 32 (flag=2 count: 0) | 'success' count for 2022-08-05: 35 (flag=2 count: 0) | 'success' count for 2022-08-04: 39 (flag=2 count: 4) | 'success' count for 2022-08-03: 39 (flag=2 count: 2) Anomalies (flag=2): TA00682 2022-08-03 | score: 308.398 (thresh: 261.338) -- 'pr' mm t: 0.000 [17.142, 23.176] TA00686 2022-08-03 | score: 182.706 (thresh: 90.000) -- 'pr' mm t: 0.255 [0.0, 2.227, 17.142, 23.176] TA00148 2022-08-04 | score: 426.326 (thresh: 90.000) -- 'pr' mm t: 0.034 [22.4] TA00364 2022-08-04 | score: 190.747 (thresh: 79.321) -- 'pr' mm t: 1.531 [23.226] TA00398 2022-08-04 | score: 1418.571 (thresh: 90.000) -- 'pr' mm t: 0.068 [38.349] TA00682 2022-08-04 | score: 3087.313 (thresh: 261.338) -- 'pr' mm t: 0.000 [49.777, 85.636] TA00160 2022-08-08 | score: 691.851 (thresh: 266.803) -- 'pr' mm t: 1.769 [0.0, 33.714] TA00686 2022-08-08 | score: 215.214 (thresh: 90.000) -- 'pr' mm t: 0.000 [0.0, 1.769, 3.809, 33.714] TA00056 2022-08-09 | score: 218.996 (thresh: 146.763) -- 'pr' mm t: 0.000 [0.0, 0.017, 20.2] TA00196 2022-08-09 | score: 112.270 (thresh: 89.761) -- 'pr' mm t: 0.000 [0.0, 1.785, 20.2] -------- Total 'failure' count: 115 Total record count: 1192 Job history table record count: 72658 Scoring job record table record count: 335 -----------------------------------------------------------