@@ -42,6 +42,7 @@ Wellbore Domain Data Management Services (Wellbore-DDMS) Open Subsurface Data Un
-[pandas](https://pandas.pydata.org/) and [numpy](https://numpy.org/) for data manipulation
-[pyarrow](https://pypi.org/project/pyarrow/) for load and save data into parquet format
-[opencensus](https://opencensus.io/guides/grpc/python/) for tracing and logging on cloud provider
-[dask](https://docs.dask.org/en/latest/) to manage huge amount of bulk data
### Library Dependencies
...
...
@@ -60,6 +61,16 @@ Wellbore Domain Data Management Services (Wellbore-DDMS) Open Subsurface Data Un
## Project Startup
### Dask Configuration - Locally
By default, It will use all memory available and use CPU resources through workers. The number of workers is determined by the quantity of core the current local machine has.
### Dask Configuration - In a cluster
In a container context, such as Kubernetes we recommend to set container memory limit at 3Gi of RAM and 4-8 CPUs.
At the minimum 1.2Gi and 1 cpu but performance will be reduced, but enough to handle WellLogs of 10 curves with 1M values each.
Note: container memory is not entirely dedicated to Dask workers, fastapi service with its process also require some.
### Run the service locally
1. Create virtual environment in the wellbore project directory. This will create a folder inside of the wellbore project directory. For example: ~/os-wellbore-ddms/nameofvirtualenv
...
...
@@ -88,6 +99,12 @@ Wellbore Domain Data Management Services (Wellbore-DDMS) Open Subsurface Data Un
pip install-r requirements.txt
```
Or, for a developer setup, this will install tools to help you work with the code.
# Reserved character are listed here https://community.opengroup.org/osdu/documentation/-/blob/master/platform/tutorials/core-services/SearchService.md