...
Check that services are up and running
Confirm Mozart, Tosca, and ARIA Products pages are all accessible.
Confirm jobs are being processed by reviewing the job status in Mozart and the queue status in RabbitMQ
Confirm there are no stale queues in RabbitMQ or stale jobs in Mozart.
Review Slack alert messages
Resolve the alerts defined in the messages.
Review failed jobs
Investigate cause of failure. Resolve if possible, or contact relevant PGE developer for assistance.
Generate product accountability reports
Generate the AOI reports over the recently-processed AOI’s to assess status of processing campaigns.
Reporting
Notify customers of processing updates
Update any appropriate Jira tickets
Review AWS
Ensure there are no runaway EC2 instances in ASG
Verify that the AWS Billing Daily Cost View is at expected levels