Skip to main content
First American Financial Corporation
Oscar M., Production Coordinator

Oscar M. Production Coordinator

Search Jobs

REMOTE Principal Data Scientist

Santa Ana, California

Apply Now
Job ID R037268 Date posted Oct. 04, 2022 Category Data Management & Analytics Employment Type Full Time

Company Summary

Come join First American's Digital Title Group, newly formed to re-imagine and digitize the title search and examination process through Big Data, AI, document automation and modern, cloud-native application development. As a market leading title insurance company, powered by the nation's largest and most complete property information, ownership and recorded document database, First American is committed to advancing title automation and removing friction from the real estate closing process. Our modern title decisioning solutions create certainty and speed through data and analytics, delivered to real estate agents, lenders, title agents and homebuyers. Join a team that puts its People First! Since 1889, First American (NYSE: FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passionate about fostering an environment where all feel welcome, supported, and empowered to be innovative and reach their full potential. Our inclusive, people-first culture has earned our company numerous accolades, including being named to the Fortune 100 Best Companies to Work For® list for seven consecutive years. We have also earned awards as a best place to work for women, diversity and LGBTQ+ employees, and have been included on more than 50 regional best places to work lists. First American will always strive to be a great place to work, for all. For more information, please visit

Job Summary

First American's Digital Title Group is newly formed to drive a generational paradigm shift in the way real estate is transacted. Gone will be the days of time consuming manual Title search & examination for every property.  We are re-architecting how our industry works by leveraging our unique living title data, automated search & examination processes, and data driven risk decisioning.  Our approach will remove friction from the real estate closing process, create certainty and speed through data and analytics, and offer a seamless delivery experience to real estate agents, lenders, title agents and homebuyers. This will not only transform our industry, but also directly impact >90% of First Americans $9B+ in annual revenues, a mission and team we will invest hundreds of millions of dollars behind over the next 3-5 years.

We are looking for a Principal Data Scientist to build and deploy Natural Language processing (NLP) models utilizing a variety of Machine learning and deep learning techniques. The role will present opportunities to work on large datasets and the ability to use innovative techniques in Artificial Intelligence ranging from various NLP methods, computer vision, and deep learning to enable solutions that will be directly impactful to our customers.


  • Perform exploratory analysis, construct data pipelines, build machine learning models end-to-end from POC to deployment for large scale production systems
  • Monitor, maintain, optimize and continuously improve the deployed machine learning solutions during day to day operations
  • Deploy models through docker containers on AWS/GCP/Azure that serve real time and batch prediction results for various business functions
  • Build models that are scalable, with feedback collection system to enable retroactive training of machine learning models automatically
  • Tune model performance in terms of reduced computation time and cost
  • Establish MDM system to track model performances and generate alerts through MLFlow, AWS Sagemaker, etc.


  • Extensive experience developing end-to-end machine learning solutions and leading solution diagnosis, including designing & architecting machine learning models that solve business problems & fit into the overall engineering framework, experimentation, model pipeline build, performance optimization, integration and deployment
  • Knowledge and experience with machine learning, NLP & deep learning techniques
  • Demonstrated proficiency with programming languages such as Python & SQL
  • Familiarity with MLOps & common MLOps toolkits, e.g., MLFlow, AWS Sagemaker, etc.
  • Familiarity with engineering toolkits that are frequently used with machine learning model deployment, e.g., Git, Docker, CI/CD pipelines, Airflow, AWS EC2, AWS ECS, etc.
  • Familiarity with large scale data processing techniques & tools, e.g., multi-threaded computing, GPU computing, distributed computing in PySpark, etc.
  • Experience with data engineering, end-to-end pipeline building, engineering integration, model performance monitoring and continuous model improvement
  • PhD in quantitative field such as Mathematics, Statistics or Computer Science with 7+ years related work experience MS with 10+ years of related post-graduate work experience


First American invests in its employees' development and well-being, empowers them to provide superior customer service and encourages them to serve the communities where they live and work. First American is committed to diversity and inclusion. We are an equal opportunity employer.

Based on eligibility, First American offers a comprehensive benefits package including medical, dental, vision, 401k, PTO/paid sick leave and other great benefits like an employee stock purchase plan.

Apply Now

Related Content