Janak Kapuriya

AI/ML Engineer &
Applied Researcher

  Resume
  Linkedin
  Github
  Scholar

Hello!

I am an AI/ML Software Engineer and Applied Researcher specializing in Natural Language Processing (NLP), Generative AI, and Large Language Models (LLMs). With an M.Tech in CSE with a specialization in Artificial Intelligence and hands-on experience in end-to-end ML development, I bridge the gap between cutting-edge AI research and production-ready industrial applications.

  • Location: Galway, Ireland (Open to Relocate)
  • Legal Status: Stamp 1 (Stamp 4 Under Processing)
  • Work Preference: On-site, Hybrid, or Remote

Work Experience

Research Assistant (Machine Learning Engineer)
Insight Research Ireland Centre for Data Analytics, University of Galway | Sep 2024 - Present
  • Developed and deployed Multi-Agent Systems for automatic medical claim approval using PydanticAI, Python, RAG, LLaMA3, PostgreSQL, and Microsoft Azure. .
  • Developed an NLP system to extract key entities, topics, and relationships from various documents (PPT/Images/PDFs) using state-of-the-art open-source LLMs (Mistral-7B, LLaMA3) and Vision-Language models (Pixtral-11B), improving accuracy to 90%.
  • Owned technical delivery by containerizing ML models with Docker and deploying them as scalable REST APIs on Microsoft Azure cloud infrastructure.
  • Published research papers at top AI conferences and workshops (SIGIR, ICLR, ECIR).
Research Intern
National Institute of Informatics (NII Japan) | Mar 2024 - Apr 2024
  • Engineered a novel Semantic Frame Aggregation-based Transformer to automate live comment generation from streaming videos.
  • Outperformed state-of-the-art baselines, achieving a 4% performance improvement in Recall@1 on the LiveChat dataset.
  • Work published in the Q1 journal IEEE Transactions on Multimedia (Impact Factor: 9.7)

Education

  • M.Tech in CSE (Artificial Intelligence)
    Indraprastha Institute Of Information Technology Delhi (IIIT Delhi) | Aug 2022 - June 2024
    GPA: 8.27 / 10.00
  • B.E in Computer Engineering
    Gujarat Technological University (GTU) | Aug 2017 - June 2021
    GPA: 9.32 / 10.00

Technical Skills

  • Multi-Agent Systems & Agentic Workflows: Pydantic AI, LangChain, LangGraph
  • Enterprise RAG Pipelines & Vector Databases: Pinecone, ChromaDB, FAISS
  • Open-Source LLM Fine-Tuning & Alignment: Deepseek, Qwen, Mistral, LLaMA3, RLHF/PPO/DPO
  • LLMs APIs: OpenAI, Claude, Groq, Deepseek, Openrouter, LLaMA
  • End-to-End ML Deployment: Docker, Microsoft Azure, PySpark, CI/CD pipelines, Flask REST APIs

Publications

TopoBench: Benchmarking LLMs on Hard Topological Reasoning
Mayug Maniparambil, Nils Hoehing, Janak Kapuriya, Arjun Karuvally, Ellen Rushe, Anthony Ventresque, Noel O'Connor, Fergal Reid
Logical Reasoning of LLMs @ ICLR 2026
Preprint

A Progressive Evaluation Framework for Multicultural Analysis of Story Visualization
Janak Kapuriya, Ali Hatami, Paul Buitelaar
Arxiv Preprint 2025
paper

Enhancing Scientific Visual Question Answering via Vision-Caption aware Supervised Fine-Tuning
Janak Kapuriya, Anwar Dilawar Shaikh, Arnav Goel, Medha Hira, Apoorv Singh, Jay Saraf, Sanjana Sanjeev, Vaibhav Nauriyal, Avinash Anand, Zhengkui Wang, Rajiv Ratn Shah
LAVA @ ACM Multimedia 2025
paper

Semantic Frame Aggregation-based Transformer for Live Video Comment Generation
Anam Fatima, Yi Yu, Janak Kapuriya, Julien Lalanne, Jainendra Shukla
IEEE Transaction on Multimedia 2025
paper

Exploring the Role of Diversity in Example Selection for In-Context Learning
Janak Kapuriya, Manit Kaushik, Debasis Ganguly, Sumit Bhatia
SIGIR 2025 | Special Interest Group on Information Retrieval
paper

FlintstonesSV++ : Improving Story Narration using Visual Scene Graph
Janak Kapuriya, Paul Buitelaar
Text2Story @ ECIR 2025 | European Conference on Information Retrieval
paper

Spiritual-LLM : Gita Inspired Mental Health Therapy In the Era of LLMs
Janak Kapuriya, Aman Singh, Jainendra Shukla, Rajiv Ratn Shah
Arxiv Preprint 2025 | Under Review
paper

MM-PhyQA: Multimodal Physics Question-Answering with Multi-image CoT Prompting
Avinash Anand Janak Kapuriya, Apoorv Singh, Jay Saraf, Naman Lal, Astha Verma, Rushali Gupta & Rajiv Shah
PAKDD 2024 | Pacific-Asia Conference on Knowledge Discovery and Data Mining
paper

Deep Learning Based Named Entity Recognition Models for Recipes
Mansi Goel*, Ayush Agarwal*, Shubham Agrawal*, Janak Kapuriya*, Akhil Vamshi Konam*, Rishabh Gupta, Shrey Rastogi, Niharika Niharika, Ganesh Bagler | (*Equal Contribution)
LREC-COLING 2024 | Joint Int. Conference on Computational Linguistics, Language Resources and Evaluation
paper


Mentorship & Leadership

  • Winter 2024: Teaching Assistant for CSE508: Information Retrieval (IIIT-Delhi)
  • Monsoon 2022: Teaching Assistant for CSE201: Advance Programming (IIIT-Delhi)

  Template: Ashish Sharma