Applied AI/ML Engineer • Software Engineering Leader
Applied AI/ML engineer with hands-on experience fine-tuning and deploying transformer-based models (LLMs, BERT, GPT, T5), including RAG pipelines, AI agents, and OpenAI-compatible systems. Strong software engineering background building end-to-end ML pipelines, inference services, and workflow automation using Python, Go, Docker, and Kubernetes. Currently pursuing BS in Computer Science (AI/ML focus) at Northern Arizona University with GPA >3.7, graduating May 2026.
Applied AI/ML Engineer with proven expertise in transformer models, RAG pipelines, and production systems
Currently working as a Junior AI/ML Engineer in Flagstaff, AZ, while pursuing my Bachelor's in Computer Science (AI/ML Engineering focus) at Northern Arizona University (GPA >3.7, graduating May 2026). I specialize in fine-tuning transformer-based models across multiple architectures (encoder, decoder, encoder-decoder), building RAG (Retrieval-Augmented Generation) systems with vector search and contextual retrieval, and developing AI agents for automation and reasoning tasks. My expertise spans end-to-end ML pipelines covering data preprocessing, model training, evaluation, and deployment using PyTorch, Hugging Face Transformers, and PEFT techniques. I translate real-world problems into scalable AI solutions through experimentation, evaluation, and iterative delivery.
My professional and academic journey
Transformer Architecture Expertise: Fine-tuning and evaluating large language models (LLMs) across multiple transformer architectures including encoder, decoder, and encoder-decoder models using PyTorch, Hugging Face Transformers, and PEFT techniques. End-to-End ML Pipelines: Designed and implemented comprehensive AI/ML pipelines covering data preprocessing, model training, evaluation, and inference deployment for production environments. RAG & AI Agents: Built Retrieval-Augmented Generation (RAG) systems integrating vector search, contextual retrieval, and OpenAI-compatible APIs. Developed AI-driven workflows and agent-based applications to support reasoning, automation, and user-facing interactions.
Multi-Modal Transformer Fine-tuning: Fine-tuned encoder, decoder, and encoder-decoder transformers for NLP and multi-modal data processing, optimizing model performance and inference efficiency. RAG System & OpenAI API: Built a Retrieval-Augmented Generation (RAG) system and developed an OpenAI-compatible API for seamless integration and user context management in production environments. Significant Codebase Contributions: Contributed 50K+ lines of code and refactored 30K+ lines across internal ML tooling and APIs, applying software engineering best practices. Comprehensive Technical Documentation: Authored 150+ pages of ML/LLM technical documentation covering architecture, training, evaluation, and deployment.
Property Management System (PMS): Developed and optimized a Property Management System to improve workflows and maintainability using modern software engineering practices. Container Orchestration: Applied Docker and Kubernetes to streamline deployment processes, ensuring consistent and scalable infrastructure. Cross-functional Collaboration: Collaborated with cross-functional teams to deliver stable and reliable solutions, maintaining high code quality standards.
First Professional Experience: Secured first internship during sophomore year, marking entry into professional software engineering. Go Development: Delivered features and bug fixes in Go for enterprise software, focusing on reliability and scalability. Production Systems: Ensured reliability and scalability in production systems through rigorous testing and code review processes. Software Engineering Fundamentals: Gained hands-on experience with version control, CI/CD pipelines, and collaborative development workflows.
Led afternoon shift operations, managing team coordination and service delivery in a fast-paced hospitality environment. Developed leadership, customer service, and multitasking skills while maintaining high service quality standards.
Started professional experience at age 16 during high school summer breaks. Developed strong work ethic, team collaboration skills, and discipline while working in physically demanding construction environments. Assisted in site preparation, equipment handling, and adhered to safety protocols.
Focus: AI/ML Engineering with emphasis on production systems and scalable architectures. Relevant Coursework: Artificial Intelligence, Machine Learning, Unsupervised Machine Learning, Deep Learning, Natural Language Processing. Programming Languages: C, C++, Java, Python, Golang, TypeScript, bash, and more. Academic Projects: Developed ML models for anomaly detection and reinforcement learning for motion planning.
Led 6+ member teams in project design and delivery under time pressure across multiple hackathon events. Applied agile collaboration, pair programming, and effective project coordination to deliver innovative solutions. Demonstrated strong leadership and team management skills in high-stakes competitive environments.
Serve as liaison between students and faculty to improve academic programs and student experience. Advocate for peers, coordinate departmental initiatives, and facilitate communication across Computer Science department.
Create accessible educational videos on Computer Science and AI/ML for broad audiences. Transform complex AI/ML and software engineering topics into engaging, digestible content. Focus on making technical concepts accessible while maintaining accuracy and depth.
Technologies and tools I master
Professional testimonials from colleagues and supervisors
Technology Director at Imascono
Development of Innovative Products
"Juan José has been the best intern I've had in the last 5 years. From his very first week with us, he made an impressive impact: he quickly grasped the key concepts of LLMs and fine-tuning, and successfully debugged and optimized a fine-tuning project. Since then, he has continued to evolve the architecture, code, training datasets, and documentation."
"During this time, Juan José has consistently demonstrated hard work, strong skills in AI development, and a remarkable blend of passion and creativity."
DevOps Engineer at Trioteca
"I had the pleasure of working with Juan José at Entertainment Solutions, where he contributed to our software development team. Throughout his time with us, he demonstrated a strong foundation in programming, problem-solving, and software design."
Let's discuss how I can contribute to your AI/ML projects
jserranom04@gmail.com
+34 622 48 11 21
Flagstaff, AZ
Working for Imascono