Dear Members,
My name is Chirag, and I’m currently working on a personal project where I am exploring the integration of AI/ML capabilities, specifically focusing on image-to-text conversion. While researching Google Cloud Platform’s offerings, I came across Vertex AI, and I am eager to learn more about its capabilities in this domain.
I have a few questions regarding Vertex AI, particularly in the context of image-to-text functionality. Could you kindly share insights on the following aspects:
-
Model Training:
- How does the model training process work in Vertex AI for image-to-text applications?
- Are there specific steps or considerations that I should be aware of during the training phase?
-
Deployment of Models:
- After successfully training a model, what do we receive? Is it a code snippet that can be integrated into our backend, similar to the approach used with Google’s Teachable Machine? (https://teachablemachine.withgoogle.com/ for reference)
- Can you provide guidance on the deployment process and any best practices to follow?
-
Integration with Backend:
- Is there a recommended way to integrate the trained model with a backend system?
- Are there any specific APIs or SDKs that streamline this process?
- Also is there any specific language that I should focus more on for backend?
-
Customization of Extraction Locations:
- Can Vertex AI be configured to extract text only from specific locations within an image?
- Are there options to define regions of interest for text extraction?
I appreciate your expertise and insights as I navigate through this exploration. As mentioned earlier, this is my first venture into Google Cloud Platform, and any friendly advice or tips you can provide would be immensely helpful.
I am also open for any collaborations, if interested contact me at
Thank you for your time, and I look forward to learning from the community’s experiences.
Best regards,