Job Description: Bankai Labs, the AI and deep tech division of Bankai Groups [Panamax], is looking for a Senior Data Scientist with extensive experience of 4-5+ years in Data Science, Machine Learning (ML), Natural Language Processing (NLP), Computer Vision, Gen AI (LLMs).
The ideal candidate should demonstrate expertise in open-source frameworks, hybrid neural network architectures, and optimization techniques for GPU-based computation.
This role emphasizes designing and deploying scalable AI/ML solutions for cloud and on-premise environments while leveraging advanced mathematical concepts to drive innovation and integration into existing business solutions to enhance user experience.
Key Responsibilities: Design and optimize NLP solutions: Develop domain-specific AI/ML solutions using open-source frameworks like Hugging Face Transformers, Spa Cy, and Fast Text.
Hybrid Neural Network Architectures: Innovate by building hybrid architectures that combine pre-existing or self-developed backbones for advanced problem-solving.
Computer Vision Applications: Implement state-of-the-art algorithms and techniques using libraries such as Tensor Flow, Py Torch, and Open CV.
Mathematical Modeling: Apply vector and differential calculus, including parametric and complex parametric functions, to model and solve advanced AI problems.
GPU Optimization: Optimize neural network code for GPU architectures, facilitating efficient use of accelerators like NVIDIA GPUs and integrating Triton backends with Py Torch.
Integration and Deployment: Use tools like Fast API, Streamlit, and Flask to deploy AI/ML models and integrate them into user-friendly applications; Develop SDK to integrate into existing applications of the business Mentorship and Leadership: Guide and mentor a data science team, fostering a collaborative environment and championing the use of open-source technologies.
Stay Current with AI Trends: Continuously explore advancements in AI/ML frameworks, libraries, and deployment methodologies, ensuring cutting-edge solutions.
Maintain and deploy: Git repo management and technical documentation and user documentation Tech Stack Requirements: Core Expertise: Proficiency in open-source NLP frameworks such as Hugging Face Transformers , Spa Cy , and Fast Text.
Extensive experience with Large Language Models (LLMs) like GPT , BERT , or T5 using open-source implementations.
Advanced knowledge of Computer Vision frameworks such as Py Torch , Tensor Flow , and Keras including knowledge of OCR models.
GANs/stable diffusion for advanced computer vision and distributed training for large-scale AI models.
Hybrid Neural Network Architectures: Proven experience in combining multiple neural network backbones and building innovative architectures tailored to specific use cases.
Programming and Frameworks: Strong proficiency in C , C++ , or Rust for performance-critical AI/ML applications.
Solid understanding of the internal workings of Py Torch and Tensor Flow and their integration with Triton for optimized execution.
Mathematical Foundations: In-depth understanding of vector and differential calculus , with specific expertise in parametric and complex parametric functions.
Deployment Tools: Proficiency in using Fast API , Streamlit , and Flask for deploying and integrating AI models into production-ready systems.
GPU and Hardware Optimization: Fundamental knowledge of GPU architectures and techniques to optimize code for accelerators.
Qualifications and Skills: Education: Bachelor’s, Master’s or Ph.
D.
in Computer Science, Data Science, Applied Mathematics, or a related field.
Experience: 4-5+ years of experience in data science roles, with a focus on Numpy, Pandas, Matplotlib, Plotly, SLM/LLM, NLG and Computer Vision.
Expertise in building and deploying cloud AI/ML solutions like AWS, GCP or Azure and On-premises.
Strong analytical thinking and problem-solving skills with a focus on open-source innovation.
Excellent team collaboration and leadership abilities.
Dockerization and Fast API development and integration Proven experience in building data pipelines like Databricks, Synapse Preferred Skills: Experience with hybrid deployment models combining on-premise and cloud-based components using open-source tools.
Familiarity with CI/CD pipelines and MLOps for managing AI models using open-source platforms like MLflow.
Working knowledge of real-time data processing tools such as Kafka or Fluentd.
Familiarity with SLMs and conversation chat flow builder Familiarity with virtual assistant application development Familiarity with Fin Tech domain for AI based banking solutions Why Join Bankai Labs?Work on transformative AI projects leveraging open-source technologies.
Collaborate with a team that values innovation, scalability, and optimization in AI/ML development.