I would hereby propose the final design to defeat all cloud technical debt once and for all. Yes, you heard it right!
- All async endpoints (training, export, inference which is to be made async) will change their POST endpoints to put a message in the pub sub queue with the JSON body instead of uploading it to GCS and using gRPC to the jobs
- The cloud run job will run, and once it completes write the result to firestore or GCS (depending on artefact type and size)
I cannot find another way to optimise step 2 other than keeping the polling method. Alternatives are complex solutions like web hook which are not necessary for such a simple use case.
I would hereby propose the final design to defeat all cloud technical debt once and for all. Yes, you heard it right!
I cannot find another way to optimise step 2 other than keeping the polling method. Alternatives are complex solutions like web hook which are not necessary for such a simple use case.