Skip to content

GitLab

  • Menu
Projects Groups Snippets
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in / Register
  • I Ingestion DAGs
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Locked Files
  • Issues 35
    • Issues 35
    • List
    • Boards
    • Service Desk
    • Milestones
    • Iterations
    • Requirements
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
    • Test Cases
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Package Registry
    • Container Registry
    • Infrastructure Registry
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Code review
    • Insights
    • Issue
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Open Subsurface Data Universe Software
  • Platform
  • Data Flow
  • Data Ingestion
  • Ingestion DAGs
  • Issues
  • #44

Closed
Open
Created Mar 08, 2021 by Kishore Battula@kibattulOwner

Manifest ingestion not picking dataset id generated by file service

In Azure to refer a file in manifest following steps are performed

  1. Generate signed url
  2. Upload content to signed url
  3. Create file metadata
  4. Refer the id received from create file metadata in the manifest.

The above steps are needed because manifest ingestion doesn't talk to file service which will reduce step 3 (which will be part of manifest ingestion).

Because of this limitation manifest file will have actual id from platform when referring a file. Master code of manifest ingestion is throwing exception when validating dataset ids. Please refer below for logs

Failed validating 'pattern' in schema['properties']['data']['allOf'][1]['properties']['Datasets']['items']:
    {'description': 'The SRN which identifies this OSDU File resource.',
     'pattern': '^(surrogate-key:.+|[\\w\\-\\.]+:dataset\\-\\-[\\w\\-\\.]+:[\\w\\-\\.\\:\\%]+:[0-9]*)$',
     'type': 'string',
     'x-osdu-relationship': [{'GroupType': 'dataset'}]}

On instance['data']['Datasets'][0]:
    'opendes:dataset--File.Generic:253ea910-a13d-4759-a168-73310a8b2b2e'
Assignee
Assign to
Time tracking