Re: [Rainqc-jobman] database problem processing jobs today

Similar to about 10 days ago, there was a database issue processing the jobs today. Over the past year and a half, give or take, we've had only four unusual exceptions processing the jobs, including the one today: ---------- JOBMAN.LOG.20221212 2022-12-12 22:03:24.640 | ERROR : Exception in process_jobs_for_day: InterfaceError: connection already closed ---------- JOBMAN.LOG.20230903 2023-09-02 22:46:39.415 | ERROR : Exception in process_jobs_for_day: could not translate host name "31b393d2-9a3f-4c15-b86f-33579993dbb5.bc28ac43cf10402584b5f01db462d330.databases.appdomain.cloud" to address: Unknown host ---------- JOBMAN.LOG.20231213 2023-12-12 22:46:41.263 | ERROR : Exception in process_jobs_for_day: current transaction is aborted, commands ignored until end of transaction block ---------- JOBMAN.LOG 2023-12-22 23:34:29.115 | ERROR : Exception in process_jobs_for_day: current transaction is aborted, commands ignored until end of transaction block These last two appear as if some other process is accessing the database when we are processing the jobs. The first one above on 2022-12-12 might also be a database related error. The second one, from 2023-09-02, seems to be related to DNS/network routing, an area where I still see occasional problems, mostly related to Kubernetes. Michael
participants (1)
-
Slater, Michael