Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

...

...

...

...

...

...

...

...

Table of Contents

Table of Contents

Related Github Repos and

...

Tickets

Job Runtime

  • Depends on how many SLCs are being procssed

  • 1+ hours for 8 SLCs

Objective

Creating a stack of SLCs

...

Prerequisite to STAMPS processing

How to

...

Set Up the

...

Inputs

  • Facets to get SLC inputs

    • region (ex. Hawaii) (optional)

    • track_number or trackNumber (depends)

    • datatype: SLC

...

  • Code Block
    raise  Exception('Could not determine a suitable burst offset')
  • There must only be one track in your SLC inputs

correct facet SLC inputs incorrect facet SLC inputs

...

Job Inputs

...

  • Bbox ([*required in Multiprocessing and optional in GNU Parallel (MBR)]

    • min_lat max_lat min_lon max_lon

...

CI Integration (Jenkins)

HySDS-

...

IO and Jobspec-

...

IO

hysds-io.json.topsstack

Code Block
{
  "label": "topsStack Processor",
  "submission_type": "individual",
  "allowed_accounts": [ "ops" ],
  "action-type":  "both",
  "params": [
    {
      "name": "min_lat",
      "from": "submitter",
      "type": "number",
      "optional": false
    },
    {
      "name": "max_lat",
      "from": "submitter",
      "type": "number",
      "optional": false
    },
    {
      "name": "min_lon",
      "from": "submitter",
      "type": "number",
      "optional": false
    },
    {
      "name": "max_lon",
      "from": "submitter",
      "type": "number",
      "optional": false
    },
    {
      "name":"localize_products",
      "from":"dataset_jpath:",
      "type":"text",
      "lambda" : "lambda met: get_partial_products(met['_id'],get_best_url(met['_source']['urls']),[met['_id']+'.zip'])"
    }
  ]
}

...

Code Block
{
  "recommended_queues": ["jjacob_stack"],
  "command": "/home/ops/verdi/ops/topsstack/run_stack.sh",
  "imported_worker_files": {
    "/home/ops/.netrc": "/home/ops/.netrc",
    "/home/ops/.aws": "/home/ops/.aws"
  },
  "soft_time_limit": 10800,
  "time_limit": 18000,
  "disk_usage": "100GB",
  "params": [
    {
      "name": "min_lat",
      "destination": "context"
    },
    {
      "name": "max_lat",
      "destination": "context"
    },
    {
      "name": "min_lon",
      "destination": "context"
    },
    {
      "name": "max_lon",
      "destination": "context"
    },
    {
      "name":"localize_products",
      "destination": "localize"
    } 
  ]
}

Job Outputs

Main file that gets executed is run_stack.sh

...

Output directory structure

...

...

Publish the contents from merged/ into a datasets directory

...

Create dataset file name template

...

Output structure of merged/

Code Block
merged/
    baselines/
        20190506/
        20190518/
        20190530/
            20190530
            20190530.full.vrt
            20190530.vrt
            20190530.xml
    geom_master/
	    *.rdr.aux.xml
        *.rdr.full
        *.rdr.full.aux.xml
        *.rdr.full.vrt
        *.rdr.full.xml
    SLC/
        20190506/
        20190518/
        20190530/
            20190530.slc.full
            20190530.slc.full.aux.xml
            20190530.slc.full.vrt
            20190530.slc.full.xml
            20190530.slc.hdr

Datasets.json Entry

  • file located in .sds/files/datasets.json

Code Block
{
  "ipath": "ariamh::data/STACK",
  "match_pattern": "/(?P<id>coregistered_slcs-(?P<year>\\d{4})(?P<month>\\d{2})(?P<day>\\d{2})(?P<time>\\d{6}).+)$",
  "alt_match_pattern": null,
  "extractor": null,
  "level": "NA",
  "type": "stack",
  "publish": {
    "s3-profile-name": "default",
    "location": "s3://s3-us-west-2.amazonaws.com:80/##BUCKET##/datasets/{type}/{version}/{year}/{month}/{day}/{id}",
    "urls": [
      "http://##WEBDAV_URL##/datasets/{type}/{version}/{year}/{month}/{day}/{id}",
      "s3://##S3_URL##:80/##BUCKET##/datasets/{type}/{version}/{year}/{month}/{day}/{id}"
    ]
  },
  "browse": {
    "location": "davs://##WEBDAV_USER##@##WEBDAV USER##/browse/{type}/{version}/{year}/{month}/{day}/{id}",
    "urls": [
      "https://##WEBDAV##/browse/{type}/{version}/{year}/{month}/{day}/{id}"
    ],
  }
}

ASG (Auto Scaling Group) Configurations

EC2 Instance Type

c5d.9xlarge

c5.24xlarge

Block Devices

/dev/sdc

  • Size (GB): 2048

  • Type: gp2

  • Delete on termination: True

/dev/sda1

  • Size (GB): 20

  • Type: gp2

  • Delete on termination: True

/dev/sdb

  • Size (GB): 50

  • Type: gp2

  • Delete on termination: True

/dev/sdc

  • Size (GB): 2048

  • Type: gp2

  • Delete on termination: True

/dev/sda1

  • Size (GB): 250

  • Type: gp2

  • Delete on termination: True

/dev/sdb

  • Size (GB): 50

  • Type: gp2

  • Delete on termination: True

Spot Price

1.728

2.57

...

Assessment of Steps

https://docs.google.com/spreadsheets/d/1W2KzAWm8VjceB77jfc9kuQUrXtYDiwSPmqxcHuEoQIg/edit#gid=0

Job Runtime

  • Depends on how many SLCs are being processed and number of bursts covered by Bbox.

GNU Parallel


c5d.9xlarge

(36 vCPU, 72 GiB)

c5.24xlarge

(96 vCPU, 192 GiB)

1 year (~30 SLCS, 4 bursts)

7 hrs, 24 mins, 46 secs

4 hrs, 38 mins, 33 secs

2 year (~60 SLCS, 4 bursts)

13 hrs, 37 mins, 39 secs

8 hrs, 16 min, 46 secs

Multiprocessing


c5d.9xlarge

(36 vCPU, 72 GiB)

c5.24xlarge

(96 vCPU, 192 GiB)

1 year (~30 SLCS, 4 bursts)

6 hrs, 43 mins, 27 secs

4 hrs, 19 mins, 30 secs

2 year (~60 SLCS, 4 bursts)

12 hrs, 58 mins, 32 secs

8 hrs, 30 mins, 43 secs

3 year (~90 SLCS, 4 bursts)

18 hrs, 14 mins, 44 secs

10 hrs, 56 mins, 5 secs