Data Scientist – AI / LLM

Diagonal Matrix Ltd

Full-time | Contract

Remote

Job description

Overview

We are seeking a motivated Data Scientist with AI and Large Language Model experience to join our growing technology team in the USA. This role is ideal for recent graduates or early-career professionals who are passionate about data science, machine learning, artificial intelligence, and LLM-based applications.

The successful candidate will work on real-world AI and data science projects involving data analysis, predictive modelling, natural language processing, generative AI, Retrieval-Augmented Generation, and LLM-powered automation. This is an excellent opportunity for candidates on OPT, CPT, or STEM OPT who want to build strong commercial experience in the AI and data science industry.

Responsibilities

Design, develop, and evaluate machine learning models for business and customer-focused use cases.

Work with structured and unstructured datasets to identify patterns, trends, insights, and predictive signals.

Build data science solutions using Python, SQL, Pandas, NumPy, Scikit-learn, and related libraries.

Support the development of AI and LLM-based applications using tools and frameworks such as OpenAI, Azure OpenAI, LangChain, Hugging Face, and vector databases.

Assist in building Retrieval-Augmented Generation pipelines using embeddings, document chunking, vector search, and semantic retrieval.

Develop and test prompts for LLM applications, including summarisation, classification, question answering, information extraction, and chatbot workflows.

Collaborate with data engineers, software developers, and business stakeholders to understand requirements and convert them into data-driven solutions.

Perform exploratory data analysis, data cleaning, feature engineering, model training, validation, and performance evaluation.

Work with cloud platforms such as AWS, Azure, or Google Cloud for data storage, AI services, and model deployment.

Create dashboards, reports, and visualisations to communicate insights clearly to technical and non-technical audiences.

Monitor model performance and support improvements to accuracy, reliability, explainability, and responsible AI practices.

Maintain clear documentation of models, datasets, experiments, assumptions, and technical processes.

Required Qualifications

Bachelor’s or Master’s degree in Data Science, Computer Science, Artificial Intelligence, Machine Learning, Statistics, Mathematics, Engineering, or a related field.

Recent graduate or early-career professional with academic, internship, project, or commercial experience in data science or AI.

Strong understanding of machine learning concepts such as supervised learning, unsupervised learning, classification, regression, clustering, and model evaluation.

Good programming skills in Python.

Good knowledge of SQL for querying and analysing data.

Experience with common data science libraries such as Pandas, NumPy, Scikit-learn, Matplotlib, or similar tools.

Understanding of data preprocessing, feature engineering, model training, and evaluation techniques.

Basic understanding of Large Language Models, Generative AI, NLP, embeddings, vector databases, or RAG pipelines.

Strong analytical thinking and problem-solving skills.

Good communication skills with the ability to explain technical findings in a simple and clear way.

Preferred Qualifications

Hands-on experience with OpenAI, Azure OpenAI, Anthropic, Hugging Face, LangChain, LlamaIndex, or similar AI/LLM tools.

Experience working with vector databases such as Pinecone, FAISS, ChromaDB, Weaviate, or Qdrant.

Experience with cloud platforms such as AWS, Azure, or Google Cloud.

Knowledge of data visualisation tools such as Power BI, Tableau, Looker, or Streamlit.

Experience with Git, APIs, Docker, FastAPI, or basic software development practices.

Academic or personal projects involving chatbots, document Q&A systems, recommendation engines, forecasting, NLP, computer vision, or predictive analytics.

Understanding of responsible AI, bias, model explainability, data privacy, and AI safety principles.

Visa / Work Authorization

We welcome applications from candidates who are currently authorised to work in the United States, including eligible F-1 students and recent graduates on CPT, OPT, or STEM OPT, subject to applicable university and immigration rules.

Candidates must have valid work authorization before starting employment. For CPT candidates, university approval and CPT authorization must be completed before the employment start date.

Job Types: Full-time, Contract

Pay: $75,000.00 - $125,000.00 per year

Application Question(s):

Are you currently based in the United States?
Are you currently on CPT, OPT, STEM OPT, H-1B, Green Card, US Citizen, or another work authorization category?
Do you require employer sponsorship now or in the future?
Are you available for full-time employment?
When can you start?
What is your Visa? eg: CPT, OPT, STEM OPT, H-1B

Work Location: Remote

Apply