Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Backend

...

BOS Sarcat scraper (stand-alone CLI tool)

Front end GUI

...

Instructions: https://github.com/aria-jpl/bos_sarcat_scraper/tree/master/dataset

PGE: bos_ingest

job-spec

  • Code Block
    languagejson
    {
      "required_queues": ["factotum-job_worker-small"],
      "container": "container-aria-jpl_bos_sarcat_scraper:master",
      "command": "/home/ops/verdi/ops/bos_sarcat_scraper/create_acquisitions.sh",
      "imported_worker_files": {
        "/home/ops/verdi/etc/datasets.json": "/home/ops/verdi/etc/datasets.json"
      },
      "disk_usage": "1GB",
      "params": [
        {
          "name": "bos_ingest_time",
          "destination": "context"
        },
        {
          "name": "from_time",
          "destination": "context"
        },
        {
          "name": "end_time",
          "destination": "context"
        }
      ]
    }

hysds_io

  • Code Block
    languagejson
    {
      "submission_type": "individual",
      "component": "tosca",
      "label": "Ingest acquisitions from Bos SARCAT",
      "allowed_accounts": [ "ops" ],
      "params" : [
        {
          "name": "bos_ingest_time",
          "from": "submitter",
          "type": "text",
          "optional": true,
          "placeholder":"start of bos_ingest_timestamp in format yyyy-mm-ddThh:mm:ss.sssZ"
        },
        {
          "name": "from_time",
          "from": "submitter",
          "type": "text",
          "optional": true,
          "placeholder":"start of acquisition time in format yyyy-mm-ddThh:mm:ss.sssZ"
        },
        {
          "name": "end_time",
          "from": "submitter",
          "type": "text",
          "optional": true,
          "placeholder":"end of acquisition time in format yyyy-mm-ddThh:mm:ss.sssZ"
        }
     ]
    }

datasets.json

Code Block
languagejson
  {
    "ipath": "ariamh::data/acquisition-SARCAT",
    "match_pattern": "/(?P<id>acquisition-(?P<spacecraft>.+?)_(?P<year>\\d{4})(?P<month>\\d{2})(?P<day>\\d{2})T(?P<timestamp>.+?)Z_(?P<track>\\d?.+)_(?P<mode>.+?)-bos_sarcat)$",
    "alt_match_pattern": null,
    "extractor": null,
    "level": "l1",
    "type": "bos-acquisition",
    "version": "2.0"
  },
  {
    "ipath": "ariamh::data/acquisition-SARCAT",
    "match_pattern": "/(?P<id>acquisition-(?P<spacecraft>.+?)_(?P<year>\\d{4})(?P<month>\\d{2})(?P<day>\\d{2})T(?P<timestamp>.+?)Z_(?P<track>\\d?.+)_(?P<mode>.+?)-bos_sarcat-planned)$",
    "alt_match_pattern": null,
    "extractor": null,
    "level": "l1",
    "type": "bos-acquisition",
    "version": "2.0"
  },
  {
    "ipath": "ariamh::data/acquisition-SARCAT",
    "match_pattern": "/(?P<id>acquisition-(?P<spacecraft>.+?)_(?P<year>\\d{4})(?P<month>\\d{2})(?P<day>\\d{2})T(?P<timestamp>.+?)Z_(?P<track>\\d?.+)_(?P<mode>.+?)-bos_sarcat-predicted)$",
    "alt_match_pattern": null,
    "extractor": null,
    "level": "l1",
    "type": "bos-acquisition",
    "version": "2.0"
  },

PGE: scrub_outdated_bos_acqs

Cron scripts

Crontab settings: https://github.com/aria-jpl/bos_sarcat_scraper/blob/master/crons/crontab-setting.txt

...

On-demand ops scripts to catch up

https://github.com/aria-jpl/bos_sarcat_scraper/blob/develop/mass_catchup_script.py

Script to temporal segment scrape jobs.

Flask App Services

  • ICS

  • KML

  • CSV

Other

  • Location

  • Log files

  • Debugging process

  • Deployment

  • Watchdogs to check on hourly scraper already in. current checks on bos_ingest_:master

  • ES on b-cluster

    • Alias for sar-availability: acquisition

...