Processing aborted repeatedly due to "Authorization token expired"

jeroen.dries · 15 November 2021 11:53

Not entirely in the sense that you’ll still get authentication issues when EGI is down.
execute_batch indeed sometimes fails due to authentication or some other server unavailability, but in that case, the job can still succeed, and be inspected with:

job = connection.job(job_id)
job.status()

(which will still give you an authentication exception when EGI is down)

Anyway, for a more technical discussion and solution proposals, see this issue:

github.com/Open-EO/openeo-python-driver

Better handling of HTTP issues/timeouts when resolving OIDC access tokens

opened 10:33AM - 15 Nov 21 UTC

soxofaan

Issue raised in openEO Platform forums: > 0:01:01 Job ‘vito-ec7c5c49-a54f-422…1-b659-e3e18ecb1fbd’: queued (progress N/A) > 0:01:11 Job ‘vito-ec7c5c49-a54f-4221-b659-e3e18ecb1fbd’: queued (progress N/A) > 0:01:24 Job ‘vito-ec7c5c49-a54f-4221-b659-e3e18ecb1fbd’: queued (progress N/A) > OpenEoApiError: [500] unknown: [403] TokenInvalid: Authorization token has expired or is invalid. Please authenticate again. in application logs I found around time of that job: > [2021-11-11 16:55:37,660] 9 WARNING in openeo_driver.users.auth: Failed to resolve OIDC access token > ... > requests.exceptions.ConnectionError: HTTPSConnectionPool(host='aai.egi.eu', port=443): Max retries exceeded with url: /oidc/.well-known/openid-configuration (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x7f90df136910>: Failed to establish a new connection: [Errno 110] Connection timed out')) If aai.egi.eu is (partially) down, resolving the access token fails what could be improved: - openeo-python-driver: at least make error clearer that the problem is with the identity provider, not the access token itself - openeo-python-driver: add a bit of retry logic to cover temporary glitches - openeo-python-client: don't stop the batch job status poll loop when such a temp glitch happens