Skip to content

GitLab

  • Menu
Projects Groups Snippets
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Sign in / Register
  • P Python SDK
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Locked Files
  • Issues 3
    • Issues 3
    • List
    • Boards
    • Service Desk
    • Milestones
    • Iterations
    • Requirements
  • Merge requests 1
    • Merge requests 1
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
    • Test Cases
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Packages & Registries
    • Packages & Registries
    • Package Registry
    • Container Registry
    • Infrastructure Registry
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Code review
    • Insights
    • Issue
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar

Upcoming Change to GitLab Pages The GitLab Pages URL is planned to migrate from the current *.pages.community.opengroup.org to a simpler domain: *.pages.opengroup.org. At the same time, HTTPS will be enabled for the server. This change is planned for May 21, 2022.

If you have questions or concerns, please email forum-support@opengroup.org.

  • Open Subsurface Data Universe Software
  • Platform
  • System
  • SDKs
  • Python SDK
  • Merge requests
  • !22

Merged
Created Aug 09, 2021 by Aleksandr Spivakov (EPAM)@aleksandr_spivakovDeveloper2 of 6 tasks completed2/6 tasks

GONRG-2913: Added support for whitelist reference patterns

  • Overview 0
  • Commits 1
  • Pipelines 8
  • Changes 5

Type of change

  • Bug Fix
  • Feature

Does this introduce a change in the core logic?

  • [Yes]

Does this introduce a change in the cloud provider implementation, if so which cloud?

  • AWS
  • Azure
  • GCP
  • IBM

Updates description?

Closes osdu/platform/data-flow/ingestion/external-data-sources/external-data-framework#180 (closed)

Introduces availability to whitelist references using custom regexp patterns to exclude those references from referential integrity validation.

For example, we found list of references which should be validated in default scenario:

[
    "osdu:reference-data--ResourceSecurityClassification:RESTRICTED",
    "osdu:master-data--Wellbore:1013",
    "osdu:reference-data--UnitOfMeasure:M",
    "osdu:reference-data--UnitOfMeasure:GAPI",
    "osdu:reference-data--UnitOfMeasure:US/F",
    "osdu:reference-data--UnitOfMeasure:G/C3",
    "osdu:reference-data--UnitOfMeasure:V/V"
]

After we may realize that some of these references should not be validated. So with new whitelist feature we can write our custom regexp patterns by which we could exclude needed patterns. Let's say we not interested in validation of these:

[
    "osdu:reference-data--UnitOfMeasure:GAPI",
    "osdu:reference-data--UnitOfMeasure:V/V"
]

First, we need to write somewhere our custom patterns. Suppose they will look like these (groups are required, because inner logic relies on them in current implementation):

\"(?P<key>CurveUnit)\":\s?\"(?P<value>[\w\d:-]*:GAPI:)\"
\"(?P<key>CurveUnit)\":\s?\"(?P<value>[\w\d:-]*:V\/V:)\"

They either can be parsed from file or from Airflow variable, for example (see osdu/platform/data-flow/ingestion/ingestion-dags!64 (merged)).

After that we can pass custom patterns as string to ManifestIntegrity on initialization. If they are valid we will receive new list of references for validation:

[
    "osdu:reference-data--ResourceSecurityClassification:RESTRICTED",
    "osdu:master-data--Wellbore:1013",
    "osdu:reference-data--UnitOfMeasure:M",
    "osdu:reference-data--UnitOfMeasure:US/F",
    "osdu:reference-data--UnitOfMeasure:G/C3"
]

As we can see, we whitelisted two references so they will be skipped on referential ingetrity validation stage.

Closes osdu/platform/data-flow/ingestion/external-data-sources/external-data-framework#180 (closed)

Edited Aug 09, 2021 by Siarhei Khaletski (EPAM)
Assignee
Assign to
Reviewer
Request review from
Time tracking
Source branch: GONRG-2913_whitelist_reference_patterns_feature