Troubleshooting#
Credentials#
Below are some common errors you might run into when using the credential CLI. Please follow the suggested steps to troubleshoot. Please also refer to Setup Credentials for more information
Could not connect to the endpoint URL#
Please confirm if the data endpoint URL is valid
Extra fields not permitted#
Please make sure you don’t provide extra field when setting credentials with payload. The tabulated information illustrates the keys that are compulsory and those that are optional for the payload corresponding to each type of credential.
SignatureDoesNotMatch#
Please check if you access_key_id and access_key are valid.
Max retries exceeded with url#
Please check if your registry is valid.
Registry authentication failed#
Please check if you registry username and auth is valid.
Duplicate key value#
Please rename your credential or delete it with $ osmo credential delete <your_cred_name>
and then reset it.
Dataset#
Below are some common errors you might run into when using the dataset CLI. Please follow the suggested steps to troubleshoot. Please also refer to Data or Working with Data for more information.
Validation error#
Please confirm if the access_key_id set for your data credentials is the same as the Shared
Storage S3 ACL Access User found at Data
If the access_key_id does not have the correct permissions, ask an admin for permission.
No default bucket#
Please set a default bucket as specified at Data
Resources#
Please make change to the workflow resource specs based on the detailed error message. Please
also refer Overview to make sure your resource spec is correct. Set the labels,
cpu/gpu, memory, storage based on the current pool/platform availability osmo resource list
Some common errors are listed below:
Too high for label memory#
Please check the available memory and set it correctly.
Too high for label cpu#
Please check the available cpu and set it correctly.
Too high for label storage#
Please check the available storage and set it correctly.
Does not allow mount#
If you need specific host mounts, reach out to admin to update the platform configs.
Workflow#
When a workflow fails, refer to query to gain an overview of the workflow tasks statuses on which pods failed as well as their failure messages. Refer to Status Reference that contains more information regarding different workflow statuses.
Use logs for a better insight of what happened during the workflow runtime.
137 Error Code#
When a task exits with exit code 137, it usually signifies that your task was killed due to
using too much memory.
A user can confirm this if the admins have setup a Grafana Dashboard for detailed
workflow usage information. To see the dashboard, users can click on the Resource Usage button
in the UI on the detailed workflow information page.
To resolve the memory issue, users can try increasing the amount of memory requested or lower the memory usage within the task. To learn more about workflow resources, refer to Overview.