🚀 Hiring: 2 Senior R&D Contractors (Multimodal AI – Image + Text)
💼 Contract: 3–6 Months | T&M | Staff Augmentation
🌐 Remote | 🕓 Immediate Start
Our client is building a next-generation multimodal AI model combining biometric image data (e.g., tongue images) with text-based clinical descriptions and diagnoses.
We are in the R&D phase and require highly skilled senior engineers who can build clinically meaningful, interpretable, and robust models from the ground up.
1️⃣ Senior Computer Vision / Image Processing Engineer
Role Overview
Own the complete image pipeline—preprocessing, color correction, segmentation, detection, and dataset annotation workflow design.
Key Responsibilities
- Implement advanced color correction (Gray World, Retinex, SVM/SVR-based).
- Perform morphological operations (erosion, dilation, opening/closing, interpolation).
- Expert-level segmentation using:
- Classical: Active Contour/Snakes, Watershed, Graph Cuts, Region Growing
- Deep learning: UNet, UNet++, DeepLab, Mask R-CNN
- Customize and fine-tune YOLO-family detection models.
- Work with large datasets; define annotation protocols, tools, and taxonomies.
- Ensure image pipeline supports clinical interpretability & robustness.
Required Skills
- Python, OpenCV, NumPy
- PyTorch / TensorFlow
- UNet / UNet++ / DeepLab / Mask R-CNN
- Classical segmentation (watershed, snakes, graph cuts)
- Color correction methods
- Morphological operations
- Experience with Labelbox / CVAT / Supervisely
- Designing taxonomies for clinical datasets
2️⃣ Senior Machine Learning / Deep Learning Engineer
Role Overview
Lead development of fusion models combining image + text modalities with custom neural architectures and interpretability-first design.
Key Responsibilities
- Architect & optimize complex neural networks: CNNs, GCNs, KAN, interpretable models.
- Fine-tune pre-trained models for clinical datasets.
- Perform feature selection (LASSO, Random Forest, SHAP).
- Build cross-modal fusion models (image + text) using:
- Custom loss functions (alignment, consistency loss)
- Attention-based fusion mechanisms
- Evaluate using advanced metrics:
- Segmentation: IoU, Dice, Hausdorff
- Detection: mAP, Precision–Recall
- Classification: F1, ROC, multi-label metrics
- Guide annotation strategy for multi-label, clinically meaningful datasets.
- Ensure models are interpretable, reliable, and clinically relevant.
Required Skills
- Text + Image fusion
- CNNs, GCNs, KAN
- Custom loss function design
- Attention mechanisms
- Feature selection frameworks
- Multi-label classification expertise
- Deep metric understanding (IoU, Dice, mAP, PR, ROC)
📩 Apply / Refer Immediately
📧 Email: riyaz.ltpl@gmail.com
📱 WhatsApp: +91 9595175635
💬 WhatsApp Group:
https://chat.whatsapp.com/HllHQes4d2QLFyukXjsIZv
https://whatsapp.com/channel/0029Vb7OZnz65yDDa1kT6i0p
🔗 Telegram Channel: Rozgaar Ki Nayi Duniya
🌐 Blog: WorkNestProX.com
🔥 Trending Hashtags for Maximum Reach
#ServiceNow #ServiceNowJobs #IRM #GRC #RiskManagement #PolicyAndCompliance
#ServiceNowDeveloper #NowHiring #RemoteJobs #TechJobs #HiringAlert
#ImmediateJoiners #WorkNestProX #IndiaJobs #SNOWJobs #PlatformDeveloper
#ITJobsIndia