
Research Scientist, Vision Language and Multimodal Modeling
- Sydney, NSW
- Permanent
- Full-time
- PhD degree in Computer Science, a related field, or equivalent practical experience.
- One or more scientific publication submissions for conferences, journals, or public repositories (such as CVPR, ICCV, NeurIPS, ICML, ICLR, etc.).
- Experience in areas like face anti-spoofing, biometrics, 3D/2.5D vision, facial landmark/pose estimation.
- Experience with TensorFlow, Flume, common computer vision libraries/frameworks and Android.
- Excellent software engineering skills (e.g., C++, python, data processing, production backend development, etc.).
- Author research papers to share and generate impact of research results across the team and in the research community.
- Help in growing research business across teams by sharing research trends and best practices within the community.
- Define the data structure, framework, design, and evaluation metrics for research solution development and implementation under minimal guidance. Identify timelines and obtain resources needed.
- Identify new and upcoming research areas by interacting with external and internal collaborators. Help in developing research strategy and plans to expand the impact of Google research with some guidance.
- Contribute to conducting experiments based on the research question. Develop research prototypes or conduct simulations to further evaluate the impact of research, finalize hypotheses, and refine the research methodology under minimal guidance.